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PREFACE 


The rapidly decreasing time lag between scientific discoveries and appli¬ 
cations imposes ever-increasing demands on the mathematical equipment 
of scientists and engineers. Although the mathematical preparation of 
enginet'.ring students has been strengthened materially in the past thirty 
years, the introduction of courses beyond the tiaditional ^'terminal course^' 
in calculus has been largely confined to a few leading institutions. The 
reluctance to broaden significantly the program of instruction in mathe¬ 
matics can be attributed in part to the crowded engineering^ curricula, in 
part to the failure to sense the central position of mathemjrtics in sciences 
and technology, and in part to the scarcity of suitable staffs and instruc¬ 
tional media. The broadening, however, is inevitable, for it is now gen¬ 
erally recognized that no professional engineer can keep abreast of scien¬ 
tific developments without substantially extending his mathematical hori¬ 
zons. 

This book, in common with its predecessor written by the senior author 
some twenty-fn^e years ago, has as its main aim a sound extension of such 
horizons. The authors not only have Ix^n guided by their subjective 
appraisal of the live present-day needs of the engineering profession but 
have also taken into account the views of the leaders of engineering 
thought as expre.ssed in numerous conferences and symposia on engineer¬ 
ing education sponsored by the National Science Foundation, the American 
Society of Engineering Education, and its predecessor the Society for the 
Promotion of Engineering Education. 

There are many conflicting and often prejudiced currents of thought as 
to how mathematics should be presented to students of applied sciences. 
Some believe that mathematics is one whole and indivisible and hence 
sliould be presented unto all alike, regardless of the differing creeds. Others 
are content with a catalogue of useful formulas, rules, and devices for 
solving problems. The authors think that those two extreme viewpoints 
are somewhat limited, since they recognize only two of the many facets of 
mathematics. A preoccupation with the logic of mathematics and the over¬ 
emphasis of a convention called rigor are among the best known means for 
stifling interest in mathematics as a crutch to common sense. On the other 
hand, a presentation which puts applications above the medium making 
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PEEFACE 


applications possible is sterile, because it gives no inkling of the supreme 
importance of generalizations and abstractions in applications. The au¬ 
thors have tried to strike a balance which would make this book both a 
sound and an inspiring introduction to applied mathematics. 

The material in tliis book appears in nine chapters, each of which is 
complete and virtually independent of the others. Occasional cross refer¬ 
ences to other chapters are intended to correlate the topics and to enhance 
the usefulness of the book as a reference volume. P^ach chapter is sub¬ 
divided into functional parts, many of which also form an organized whole. 
The earlier parts of each chapter are less advanced and should seiwo as 
an introduction to more difficult topics treated in the later part/S. The 
text material set in small type usually deals with generalizations and de¬ 
velops the less familiar concepts which are sure to grow in import,ance in 
applications. 

Tfie choice of topics is based on the authors' estimate of the frequency 
with which the subjects treated occur in applications. The illustrative 
material, examples, and problems have been chosen more for their value 
in emphasizing the underlying principles than as a collection of instan(*es 
of dramatic uses of mathematics in specific situations confronting prac¬ 
ticing engineers. 

Although the book is written so as to require little, if any, outside help, 
the reader is cautioned that no amount of exposition can serve as a substi¬ 
tute for concentration in following the course of the argument in a serious 
discipline. In order to facilitate the understanding of the principles and 
to cultivate the art of formulating ph^^sieal problems in the langxjage of 
mathematics, numerous illustrative examples are worked out in detail. 
The authors believe with Newton that exempla nan minus doceunt quam 
precaepta, 

L Sokolnikoff 
R, Af. Redheffer 



TO THE INSTRUCTOR 


In the sense that a working course in calculus is the sole technical pre¬ 
requisite, this book is suitable for the beginner in applied mathematics. 
But when viewed in the light of the present-day requirements of the engi¬ 
neering profession, the text includes a large amount of material of direct 
interest to practicing engineers. 

It is certain that within the next twenty years the methods of functional 
analysis and, in particular, the Hilbert space theory will be in general use 
in technology. A foundation for the assimilation of the function-space 
concepts should be laid now, and we did not hesitate to do so in several 
places in this book. 

We liave arranged the contents in nine independent chapters which, in 
turn, are subdivided into parts, most of which can be read independently 
of the rest. The earlier parts of each chapter are less advanced, and our 
experience has shown that several introductory courses for students of sci¬ 
ence and technology can be based on the material contained in the earlier 
parts. When taken in sequence, this book has ample substance for four 
consecutive semester courses meeting three hours a week. 

This book is also suitable for courses in mathematical analysis bearing 
such labels as ordinary differential equations, partial differential equations, 
vector analysis, advanced calculus, complex variable, and so on. 

Thus Chap. 1, when supplemented by Secs. 12 to 14 of Chap. 2, has 
adequate material for a solid semester course in ordinary differential equa¬ 
tions. Instmctors wishing to include an introduction to numerical meth¬ 
ods of solutions of differential equations will find suitable material in Secs, 
14 to 18 of Chap. 9. The use of Laplace transforms in solving differential 
equations is discussed in Appendix B, which includes, among other things, 
a meaningful introductory presentation of the “Dirac delta function.’” 

Chapter 6, together wdth Secs. 18 to 25 of Chap. 2, has ample material 
for a semester course in partial differential equations. 

Chapters 4 and 5 have sufficient content for a modern course in vector 
analysis. 

Chapter 7, preceded by the relevant topics on line integrals in Chap. 5, 
is adequate for an introductory course in complex variable theory. 

Chapter 8 can be used in a semester course on probability theory and 
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TO THE INSTRUCTOR 


applications meeting two hours a week. A course entitled “Probability 
and Numerical Methods^^ meeting three hours a week can be based on 
the material in Chaps. 8 and 9. 

Although this book was written primarily for students of physical sci¬ 
ences, it is unlikely that a liberal arts student who followed it in an ad¬ 
vanced calculus course would be obliged to “unlearn’^ anything in his 
subsequent studies. 

The contents of this book include what we believe should be the mini¬ 
mum mathematical equipment of a scientific engineer. It may not be out 
of place to note that the mathematical preparation of physicists and engi¬ 
neers in Russia exceeds the minimum laid down here. While the curricula 
of only a few leading American engineering colleges provide now for rnon^ 
than one year of mathematics beyond calculus, their number will continue 
to increase with tlie realization that the time allotted to mathematics is a 
sound capital investment, yielding excellent returns both in the time gained 
in professional studies and in the depth of penetration. 
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The power and effectiveness of mathematical methods in the study of 
natural sciences stem, to a large extent, from the unambiguous language 
of mathematics, with the aid of wdiich the laws governing natural phe¬ 
nomena can be formiilaied. Many natural laws, especially those con¬ 
cerned with rates of change, can be phrased as equations involving deriva¬ 
tives or differentials. For example, when a verbal statement of Newton’s 
second law of motion is translated into mathematical symbols, there 
results an equation relating time derivatives of displacements to forces. 
A study of such equations then provides a com})lete qualitative and 
(quantitative characterization of the behavior of mechanical systems under 
the action of forces. Several broad types of equations studied in this 
book characterize physical situations of great diversity and practical 
interest. 

The first half of this chapter is con('emed with pnjliminaries and special 
techniciues devised for the solution of the first-order equations arising 
commonly in applications. The second half contains a comprehensive 
treatment of linear differential e(quations with constant coefficients and 
an introduction to linear equations with variable coefficients. Linear 
equations occupy a prominent place in the study of the response of elastic 
structures to impressed forces and in the analysis of electrical circuits and 
servomechanisms. They also appear in numerous boundary-value problcjms 
in the theor>^ of diffusion and heat flow, in quantum mechanics and fluid 
mechanics, and in electromagnetic theory. 


PRELIMINARY REMARKS AND ORIENTATION 

1. Definition of Terms and Generalities. Any function containing var¬ 
iables and their derivatives (or differentials) is called a differential expres¬ 
sion, and every equation involving differential expressions is called a 
differential equation. Differential equations are divided into two classes, 
ordinary and partial The former contain only one independent variable 
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and derivatives with respect to it. The latter contain more than one 
independent variable. 

The order of the highest derivative contained in a differential equation 
is called the order of the differential equation. Thus 



+ 5 ?/^ = 0 


is an ordinary differential equation of order 2, and 



dx dt 


+ yxi 


0 


is a partial differential equation of order 3. 

A function y = is said to be a solution of the differential e(|ualion 

P{x,y,i/) = 0, (M) 

if, on the substitution of ^ - ipix) and i/ ~ <p\x) in the left-hand meiiibef 
of (1-1), the latter vanishes identically.^ Again, y = (p{x) ls a solution 
of the second-order equation F(x,yj/,i/') = 0 when the substitui if)n 
y =» (p(x)f ?/ = <p\x), y" — (p"{x) reduces this to an identity in x. Simi¬ 
larly for equations of order n. 

For example, the first-order differential equation 

y' + 2ry - = 0 (1-L>) 

has a solution y ^ xe ^ , because the substitution of y ~ xe ^ and //' - 
in (1-2) reduces it to an identity 0^0. Also, the equation 

ir + y^o 

has a solution y — sin x, as can be easily verified by substitution. 

We begin our study of differential equations with the first-order eipiation 
(1-1), which w^e suppose can be solved for 1 / t,o yield tlie equation 

y' = f{^,y)- (lo) 

For reasons which will become clear presently, we shall always assum(' 
that f(x,y) is a continuous function throughout some region in the xy 
plane, and we shall study the solutions of (1-3) [or, equivalently, of (1-1)1 
in that region. 

The geometrical meaning of the term solution of (1-3) is suggest(‘(i at 
once by the interpretation of the derivative y' as the slope of the taiigcuit 
line to some curve y = ^(x), for if (x^y) is a point on the curve y = <^(x), 

^ Here, as elsewhere in this book, primes are used to denote differenliation: y' ss dy/dx, 
y” ^ d’^y/dx^, . .., 1/^"^ 
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and if at every point of this curve the slope is equal to /(x,t/), then is 

a solution of (1-3). 

One can get an idea of the shape of the curve y »= <p{x) in the following 
way: Ijct us choose a point (xo,^o) and compute 

y' “ fi^o.Vo)- (1-4) 

The number/(xo,2/o) determines a direction of the curve at (xo.yo)- Now, 

let {xi,yi) be a point near (xojyo) in the direction specified by (1-4). Then 
l/ ” f(x\,yi) determines a new direction at (xi^yi) (Fig. 1). Upon proceed¬ 
ing a short distance in this new 
direction, we select a new point 
{X 2 ,y 2 ) and at this point determine 
a new sl()i>e y' — /(x 2 ,i/ 2 )- As this 
process is continued, a curve is built 
up consisting of short line segments. 

If the succ.cssive points (oro^Vo), 

(3-1,yt), {x-i.y-i), ■■■, i^nyVn) are 

chosen near one another, the series 
of straight-hrie segments approxi- 
rnafes a smooth cur\"e y == <p{x) 
which is a solution of (1-3) associ¬ 
ated with the choice of (he initial point {xofVo)- A different choice of the 
initial point will, in general, give a different curve, so that the solutions of 
Eq. (1-3) can be view^ed as being given by a whole family of curves. Such 
curves are called integral curves^ and each curve in the family represents 
a particular soluhon or an integral of our equation. 

Also, we can make a surmise that, unless/(x,iy) in the right-hand member 
of (1-3) is a badly behaving function, for each choice of the initial point 
there will be just one solution of Eq. (1-3). This surmise is capable of 
proof, which we do not give here because it requires the use of analytical 
tools wdiich are not provided in the usual calculus courses. However, the 
statement of essential facts is easy to grasp, and since it will facilitate the 
understanding of subsequent developments, we give it here as a basic 
theorem. 

Existence and Uniqueness Theorem. The equation y' ^ S{x,y) has 
one and only one integral curve passing through each point of the region in 
which both J{x,y) and df/dy are continuous functions.^ 

Unless a statement to the contrary is made, we shall suppose that the 
restrictions imposed on f(x^y) in this theorem are fulfilled, so that Eq. 

^ It suffices to suppose thjxt \0f/dy\ is bounded in the region. Proofs of this theorem 
are contained in many books on differential equations, for example, E. L. Ince, *‘Ordi¬ 
nary Differential Equations," p. 62. also iS<‘c. 17 of this chapter. 
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(1-^) has a unique solution for each choice of (a;o,2/o) in appropriate 
region of the xy plane. 

Since by changing the initial value |/U«xg « y{x^) we get a family of 
curves depending on the arbitrarily chosen value y{xo)y the equation of 
fciiis family can be written in the form 

y = ip{xyc) (1^5) 

involving one arbitrary constant c, corresponding to the arbitrary choices 
of 2/(xo). A particular curve of the family (1-5) passing through (xo,2/o) 
is then detennined by the value of c such that //o = <p(xofC). 

A solution of the first-order equation (1-3) involving one arbitrary 
constant is called a general solution} Such solutions are often written in 
the implicit form 

= 0 , (!-()) 

where it is understood that (1-6) can be solved for y to yield the explicit 
form (1-5). In practice it may not be necessary to exhibit the explicit, 
form. The essential feature of the general solution [be it given by (1-5) 
or (1-6)] is that the constant c in it can be determined so that an integral 
curve passes through a given point (xo,t/o) of the region under consideral ion. 

We illustrate this b}^ demonstrating that throughout the xy plane the 
general solution of Eq, (1-2) can be written as 

y = e'~^\x -h c). (1-7) 

The fact that (1-7) is, indeed, a solution is easily verified by substituting 
(1-7) in (1-2), Moreover, it is a general solution, because on setting 
X ^ Xq and y « po we get 

Vo = e~~^{xo + c). (i-8) 

Thus the integral curve passing through (xo,?/o) corresponds to 

c = yoc/o - xo. 

As another example consider the equation 

dy 

(1-9) 

ax 

where /(x) is any continuous function. A general solution of this equation, 
obtained by direct integration, is 

y “ ffix) dx + c. (1-10) 

' Some first-order equations may have solutions which cannot be determined from 
the general solution for any value of e. Such solutions, called singular solutions^ arise 
only when the conditions imposed on f(x,y) in the basic theorem are not fulfilled. 
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We show next that (1-10) is a general solution of (1-9). We denote an 

indefinite integral in (1-10) by F(x), so that dFjdx = /(x). Then (1-10) 
is the same as 

y = F{x) + c. (1-11) 

On setting x ^ Xq, y ^ yo, we get 

2/0 = F{xo) + c, 
so that c ^ yo — F(xo), 

and we can, therefore, write (1-11) as 

y = F(x) - F(xo) + 2/0 
^Fix)\l + yo. (1-12) 

But from the fundamental theorem of integral calculus, 

rf(x)dx^F(x)r,, 

JZq 

and therefore (1-12) yields the desired particular solution 

y = f fi^r) dx -f j/o, (1-13) 

j Xu 

corresponding to the choice of the initial point (xo^2/o)« 

Formula (1-13) illustrates the procedure of deducing particular solutions 
by integrating the given equation (1-9) between limits. It is frequently 
simpler than the procedure of determining the desired solution by calculat¬ 
ing the constant c in the general solution from the initial data. 

The foregoing discussion can be extended to equations of higher order. 
Thus, the nth-order equation 

F(x,y,y',...,y^’''>) ^0, (1-14) 

which we shall write in the form solved for i/*"' as 

y(n) „ f{x,y,y',... ( 1 - 15 ) 

has a unique solution for n arbitrarily assigned initial values^ 

2/(a:o), ii''(a;o), • • •, i^o), (1-16) 

whenever the function f in (1-15) is continuous together with the partied 
derivatives df/Syy df/dy\ .. df/dy^^'^^\ 

When the values in (1-16) are varied, we get a family of curves, the so- 
called n-parameier family^ corresponding to n independent choices of 
constants in (1-16). The equation of this family of solutions can be written 
in the form 


y » <p{XfCu €$,,. ,,(v) 


(1-17) 
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mvolving n arbitrary constants c». A solution such as (1-17) is called a 
gemral solution of the nth-order equation (1-15) [or (1-14)], provided that 
the constants Ci in (1-17) can be determined for every given set of arbitrarily 
assigned initial values (1-16). The general solution (1-17) may also appear 
in an implicit form as 

. -. ,Cn) = 0, (1-18) 

which on solving for y should give (1-17). 

The meaning of the initial conditions (1-10), as they bear on the unique¬ 
ness of solution of the second-order equation = 0, is that the 

integral curve of this equation is determined at r — xq if the ordinate 
2/0 = y(^o) slope y*{xQ) are specified. 

To determine uniquely the solution of the third-order ecpuition, we must 
specify the value of the ordinate y^, the slope ?/[), and the value of the 
second derivative yl at x = xq. 

In the following nine sections we shall deal with first-order equations, 
which we can write in the differential notation as 

Pix,y) dx + Q{x,y) dy = 0. (1-19) 

If Q{x^y) 7 ^ 0, Eq. (1-19) gives 

dx Q{x,y) 

which is in the form (1-3) with/(x,y) = —F*{x,y)/Q{x,y). 


PROBLEMS 


Classify the following differential equations as ordinary or partial, and determine 
their orders: 


1 . 


da? \dx/ 


8. 3 /' -h sin y -h a; « 0; 

6. y" -f x^y' + a;y «» sin x; 


7. Vi + Vp - y'; 


2 . 


d^z 

da? 


+ 2 

dJ dy 


d^Z 

Oy* * 


4. dy * V^l ” dx; 



8. y'^ 4” y' ** y"'. 


Verify that the given expression is a solution of the given differential equation; 

9. y y' ^ y; 

10. 4 y' « - 1; 

11. y *» Cl sin j 4 C 2 cos x, y^' 4 y *“ 0; 

12. y tm Cl sinh x 4 ca cosh x, y" — y ^ 0; 

18. xy *• jf(x) dx, xy' 4 « fix), 

14. Integrate y' » 2x to show that its general solution is a family of parabolas y m 
4 c. Determine integral curves of this equation through (0,0), (1,1), (0,1), (1,-1). 
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16. determine the integral curve for j/" = 2r such that !/(0) — Oandy'fO) ■■ 1. Wliat 

the general solution of this equation? 

2. The Slipping of a Belt on a Pulley. To illustrate the prominence of 
differential equations in the study of various phenomena, this and the 
following three sections are primarily concerned with the task of setting 
up differential equations from })hysical principles.^ Such solutions as 
are incliuUid are intended merely as a preview of the systematic discussion 
given in the siibso(iuent sections. If he wishes, the reader may confine 
his attention to the derivation of the equations only and return to the 
question of solution after this systematic discussion has been assimilated. 

The first example is given by the 
bell-pulley arrangement of Fig. 2, 
whicli is now to be analj^zed Con¬ 
sider an element of the belt, of length 
As, which has end points Pand Q and 
subtends an angle A6 at the center 
0. Let T be the tension at P and 
T -f AT at Q, and let AF be the 
normal component of force on As 
due to the pulley. Thus AF is the 
component, along the radius OiV, 
of the total resultant force exclu¬ 
sive of T and 7' -f AT. 

Assume that the belt is stationary 
and that the pulley rotates, so that 
there is slipping, Since the element 
As IS in static e(iuilibrium, the components of force along ON must 
balance. This gives 

AO 

(T + AT) sin — + T sin ~ = AF, (2-1) 

2 2 

provided the weight of the belt is negligible or provided the pulley axis is 
vertical. Equating forces at right angles to ON leads to 

AO Ad 

(T + AT) cos-T cos — = g AF, (2-2) 

2 2 

where m is the coefficient of sliding friction.^ From (2-2) we may deduce 
AT -- g AF, Ad 0, (2-3) 

^ Further problems of the sort are treated in Secs. 18 to 20. 

* We define n by (2-2) and regard it as an experimental fact that ^ approaches the co¬ 
efficient of friction for flat surfaces or, at any rate, some limit independent of as 
Ae 0 . 


N 
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where the symbol ^ (read asymptotic to*') means ^ that the ratio of 
the quantities on each side tends to 1. Thus, if a fr, then lim a/b « L 
Equations (2-1) and (2-3) together show that AT —► 0 as —► 0. Since 
sin (A^/2) ^ A^/2, Eq. (2-1) now gives 

TAB^ AF, (2-4) 


Dividing (2-3) by (2-4) leads to AT/(T Ad) 

dT 


' Hy which becomes 


Tdd 




(2-5) 


since lim (AT/AS) = dT/dS, 

Separating the variables in (2-5) yields dT/T = n dBy which, upon in¬ 
tegration, becomes log T = + c. The initial condition T ^ Tq when 

^ « 0 gives c = log To, so that, taking exponentials, 

T « Toc^^ (2-G) 

PROBLEMS 

1 . Obtain Eq. (2-3) by equating torques about the point 0. 

8. If the pulley axis is horizontal, and if the Ixdt weighs w lb per ft, show that Eq. 
(2-3) becomes AF ^ AT — w As cose and Eq. (2-4) becomes AF T AB -h w As sin B, 
where AF is the normal component of the reaction of the pulley on As and the line OA 
in Fig. 2 is horizontal, with P above it. Deduce the difTerential equation dT/dB — tiT ^ 
tt?r(/< sin ^ 4- cos d), where r is the radius. 

S« Show that the equation in Prob. 2 becomes d(Te~^) » wre~^ (jj, sin B -f cos B) de 
when multiplied by and thus obtain the solution. 

8. Growth. Equation (2 -5), which was obtained for the tension in a 
slipping belt, arises in many other connections. For example, radium de¬ 
composes at a rate proportional to the amount present. If this amount 
is A at time t, the foregoing statement means 

dA 

— = -kA, fc > 0, (3-1) 

at 

the negative sign being chosen because A decre^es as t increases. A 
similar equation is followed by the growth of populations in certain cir¬ 
cumstances. Thus, the rate of increase being nearly proportional to the 
number N present, one can write dN/dt — kN, Again, certain organisms 

1 The relation symbolized by has many of the properties of strict equality. For 
example, if a 6 and h '^c then a c. To see this, observe that a/b —► 1, since 
a and b/c -+ 1, since b and hence, by multiplication, (a/b)(b/c) 1 *1. Thus 
a/e 1, which is to say, a'^c. The reader may verify similarly that o ^ 6 and 
c ^ d together imply ac ^ fed and a/c ^ b/d. Finally, if a fe and 6 is constant, we 
may write lim a * fe» These properties are freely used in the text. 
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SEC. 3 ] 

grow at a rate proportional to their size jS at a given time so that «» 

toS. 


Example 1. In a colony of bacteria each bacterium divides into two after a time inter¬ 
val, on the average, of length r. If there are n bacteria at time t « 0 and m at time t « 1, 
wiria n large, find the approximate value of r. 

The hypothesis implies that dN/dt « kN, approximately, with greater and greater 
accuracy as the number of bacteria N becomes large. Separating variables gives dN/N 
kdt. Now ( s* 0 corresponds to iV «» n, and f *« I corresponds to N ■■ m, by hy¬ 
pothesis. Thus, 



(3-2) 


and similarly 



( 3 ^) 


since N doubles in the interval r. Equation (3-2) gives log m — log n ** fc, and (8-3) 
gives log 2 *■ ktf so that 

log 2 

Tf m ---- 

log m — log n 


This problem illustrates the useful method of integraiion between limits for the determina¬ 
tion of constants. A justificati<m of this procedure is implicit in Sec. 1, Eq. (1-13). 

Example 2. A radioactive substance A decomposes into a new substance J5, which 
in turn decomposes into a tliird substance C. Set up a differential equation for the 
amount of B at time L 

The rate of increase of B is equal to the rate at which B is formed from A minus the 
rate at which B decomposes. Thus, denoting the amounts by A and 5, 


dt 


dt 


- kiB. 


(3-4) 


This equation has two unknowns, A and B. By (3-1), however, A «« so that 
(3-4) becomes 

dB 

— - «r kce~^* — kiB. (3-5) 


A method of solving (3-5) is given in Sec. 10. 


PROBLEMS 

1. If 3 g of a radioactive substance is present at time f «■ 1 and 1 g at ( «* 4, how much 
was present initially? 

2. In Example 2 of the text set up the differential equation for the amount of substance 
C present at time t, 

3. By actual substitution, determine o and p in such a way that B « is a solu¬ 
tion of Eq. (3-5). 

4 . The rate of decomposition of a certain chemical substance is proportional to the 

amount of the substance still unchanged. If the amount of the substance at the end of 
f hr is ac and xo is the initial amount, show that z m where k is the constant of 

proportionality. Find Jk if a; changes from 1,000 to 600 g in 2 hr. 
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6. A torpedo moving in still water is retarded with a force proportional to the veloc¬ 
ity. Find the speed at the end of t sec and the distance traveled in i sec if the initial 
speed is 30 mph. 

6. The rate at which a body is cooling is proportional to the difference in the tempera¬ 
tures of the body and the surrounding medium. It is known that the temperature of a 
body fell from 120 to 70°C in 1 hr when it was placed in air at 20°C. IIow long will it 
take the body to coolto 40*’C? 30°C? 20‘’C? 

7. The percentage of incident light absorbed in passing through a thin layeT of material 
is proportional to the thickness of the material. If 1 m. of material reduces the light to 
half its intensity, how much additional material is needed to reduce the intensity to 
one-eighth of its initial value? Obtain the answer by inspection, and check by solving 
an appropriate differential equation. 

4. Diffusion and Chemical Combination. Problems involving chemical 
reactions and the formation of mixtures often lead to differential equations; 
the discussion is similar to that of Sec. 3. For exami^le, suppose that a 
tank contains g gal of water and that brine containing w lb of salt per 
gallon flows into the tank and out again at a constant rate r gpm, starting 
at time ^ = 0. At the same time a piece of rock salt is dropped into the 
tank, where it dissolves at a constant rate of g lb per min. The mixture 
being kept uniform by stirring, it is required to find the amount of salt 
present at any time t > 0. 

This problem may be taken as the typical problem for many (piestions 
involving chemical reactions, mixing, and going into solution. The dif¬ 
ferential equation is obtained bj^ writing down the e(|uation of continuity 
(increase equals income minus outgo) for the amount of salt. Call this 
amount x = x{t) at time L In the time interval from t to t + the 
number of gallons entering the tank is r A^, since the rate of flow is r. 
Now each gallon contains w lb of salt. Hence the r A^ gal contains 

tor At (4-1) 

pounds of salt, and this, then, represents income due to the inflowing brine. 
The income due to the dissolving salt is 


q At, (4-2) 

by the definition of q. 

It remains to compute the amount of salt lost in the mixture leaving 
the system. The number of gallons leaving is r At^ the concentration of 
the mixture in pounds per gallon is x/g at time t, and hence the number 
of pounds leaving is 


-rAt (4-3) 

9 


Here £ denotes the mean value of x over the interval {t, t + At). We 
assume x to be continuous, so that 
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(4r4) 


lim X » T. 

A< 0 

From (4-1), (4-2), and (4-3) we obtain 


Ax «= w H- g -r A<, 

9 


which gives 

dx rx 

— = wr Q - 

dt g 

when we divide by A^ and let At 0, using (4-4). 


(4-5) 


Example: Find the concentration of salt at the end of 4 min when tc « 1, g? «« 2 
g =» 3, r 4. 

The differential equation IB djc/d/ * 7 ~ 2xordj-/(7 — 2x) » dt. Multiplying by —2 
and integrating give log (7 — 2 x) «= —21 -f c. Since x = 0 when i ~ 0, it is necessary 
that c =• log 7, so that — 2/ *= log (7 — 2wt:) — log 7 =* log (1 — 2x/7) or, taking exponen¬ 
tials, 1 - 2x/7 *= This gives the amount of dissolved salt x at the end of t min. 

Putting / = 4, solving for x, and noting that the concentration is not x but x/g give 
%{l - e"^) as the final answer. 


PROBLEMS 

1 . Solve the example of the text by the method of integration between limits. (See 
ITiXample 1, Sec. 3. Here issOatt^O, a:»*a:at<=»»4) 

2 . How would the disc.uasion in S<*c 4 change if the rock salt had been added at time 
t ^ io instead of time i ==* 0 ? 

3 . How would the discussion in 4 change if the rock salt dissolved at a rate pro¬ 
portional to the amount undissolved, rather than at the constant rate q‘! IhnV Tf A is 
this amount, dA/(i/ « —kA. From this find/I at time and from tliat find ^ =« —dA/dt. 

4 . Let A be the amount of a .substance at the beginning of a chemical reaction, and let 
X be the amount of the substance entered in the reaction after t sec. Assuming that tlic 
rate of change of the substance is proportional to the amount remaining, deduce that 
dx/dl ^ r(A — x), where c is a constant depending on the reaction. Show that x » 
A(1 - 

6 . Ixd, a solution contain two substances whose amounts expressed in gram molecules, 
at the beginning of a reaction, are A and B. If an equal amount x of both substances 
has changed at the time t, and if the rate of cliange is jointly proportional to the amounts 
of the substances remaining, obt/ain the equation dx/dt ^ k(A — x)(B — x). Solve, 
assuming that x ** 0 when / »= 0 . 

6 . Formulate the appropriate differential equation if the rate at which a. substance 
dissolves is jointly proportional to the amount present and to the difference between the 
actual concentration and the saturate concentration. 

The Elastic Curve. Consider a horizontal elastic beam under the 
action of vertical loads. It is assumed that all the forces acting on the 
beam lie in a plane containing the central axis of the beam. Choose the 
X axis along the central axis of the beam in undeformed state and the posi- 
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tave y axis down (Fig. 3). Under the action of external forces Fi the beam 
is bent and its central axis deformed. The deformed central axis, shown 
in the figure by the dashed line, is known as the elastic curve, and it is an 
important problem in the theorj’ of elasticity to determine its shape. 

A beam made of elastic material that obeys 
F, F* Hooke’s law is known to deform in such a way 

that the curvature K of the elastic curve is 
proporl.ional to the bending moment M. In 
fact, 

V'' M 

K = - - —-- (5-1) 

[1 + El 

Fig. 3 where E is Young’s modulus, / is the moment of 



inertia of the cross section of the beam about a 
horizontal line passing through the centroid of the section and lying in 
the plane of the cross section, and y is the ordinate of the claKstic curve. 
The important relation (5-1) bears the name BernouUi-Eiiler law. When the 
deflection of the beam is small, the slope of the elastic curve is also generally 
small and one can neglect the term (i/')^ in (5-1) to obtain an approxi¬ 
mate equation 


y 




M 

W 


(5-2) 


The bending moment M in any cross section of the beam is equal to 
the algebraic sum of the moments of all the forces acting on one side 
of the section. The moments of the forces F* are taken about a horizontal 
line lying in the cross section in question. 

Example: Consider a cantilever beam of length I built in at the end x « 0 and carry¬ 
ing in addition to a distributed load w(x) Ib per ft a concentrated load W Ib and a couple 
L ft-lb applied at the end x « I (Fig. 4). 

The resultant moment in a exoss section x ft from the end x « 0, produced by the 
loads acting to the nght of that section, is 

Mix) - / « - X)w{i) di + W(l -x)+L. 



Fio. 4 
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If w{x) 0 and L «• 0, this formula yields M «» W{1 — ®), and hencei, from (5-2), the 
differential equation of the central line of a cantilever beam subjected to the end load If is 




On integrating this equation we get 


^ \2 6 / 


-f CiX -f C2. 


The integration constants Cj and C 2 can be evaluated from the conditions 2/(0) ** 0, 
/y'(0) « 0, stating that the displacement and the slope of the central line vanish at the 
built-in end. It is readily checked that these conditions lead to 




so that the displacement d at the free end is d « Wl^/ZEL 


PROBLEMS 

1. A l>eam of length I is freely supported at its ends and is loaded in the center by a 
concentrated vertical load W, which is large in comparison with the weight of the beam 
(see Fig. 5). By symmetry, the behavior of this beam is the same as that of a cantilever 
b(‘am of length 1/2 loaded by a concentrated 

load of magnitude If/2 at its free end. Verify W 

this equivalence by direct computation of the 
elastic curve. Hint: 

^ ^ 0 < X --—]L— 

W W 

W I 2 2 

~ ’ 2 ^ ^ ^ ^ 

2. A uniform unloaded beam of length I weighs w lb per ft. Find the maximum de¬ 
flection when it is used as a cantilever beam and also when it is freely supported at each 
end. Hint: Since the reaction at the end x «* Hs H «« If/2, the moment in the cross 
section at a distance x from the end x » 0 is 


M ^ wj - x) di -(I 


THE SOLUTION OF FIRST-ORDER EQUATIONS 

6. Equations with Separable Variables. Generally speaking, the prob¬ 
lem of solving differential equations is a very difficult one. Even such a 
simple equation as y' == fiXfy) cannot be solved in general; that is, no 
formulas are available for solving the general differential equation of the 
first order. It is possible, however, to classify some of the firslK)rder dif- 
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fereatial equations according to several types and to indicate special 
methods of solution suitable for each of these. 

Promiiaent among these types are the equations with separable variables^ 
that is, equations which can be put in the form 


P(x) dx + Q{y) dy = 0, 


where P(x) is a function of x only and Q{y) is a function of y only. This 
type of equation has already been encountered in the s{>ecial examples 
solved above. Its general solution is 


Jp(x) dx + jQiy) dy = c, 


where c is an arbitrary constant. In order to obtain an explicit solution 
all that is necessary is to perform the indicated integrations. 

Example: Find a solution of xj + whi(‘h through (Od^i). 

The equation can be written as y' 4- e^{y — i/) ^ 0 or 


Integration gives 



-i- e* dx « 0. 


log - ^ c, 

1 - y 


which is a general solution. Putting x « 0, j/ « gives c * log 1 4- e® « 1, so that the 
required particular solution is 

log -h * 1. 

1 - ?/ 


PROBLEMS 

Solve the following differential equations. In Probs. 4 to 6 find a solution through the 
point (0,1). 

1. Vl — X* dj/ « V^l — dx. 2. y' * xy^ ~ x. 

A ^ ^ 4 ' 2 j 2 j 

3, y' K, -- 4, sin j; cos^ ydx^ cos^ x dy. 

sin y 

5. VTl- X dj/ « (1 4- 2/^) dx. 6. y' =» 

1 4-x 

7. Homogeneous Differential Equations. A function /(x,y) of the two 

variables x and y is said to be homogeneous of degree n provided that 

f(\z,\y) s X7(x,i/), X > 0. 

Thus, f{x,y) = x^ + x^y +■ y® is a homogeneous function of degree 3, 
and /(x, 2 /) = x^ sin (x/y) + xy k a homogeneous function of degree 2, as 
follows at once on replacing x by Xx and y by \y. 

If the differential equation is of the form 
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P{x,y) dx + Q{x,y) dy * 0, (7-1) 

where P{x^y) and Q{x^y) are homogeneous functions of the same degree, 
then (7-1) can be written in the form 


2 /'- 


P{x,y) 

Qi^yV) 




(7-2) 


where <#>(a:, 2 /) is a homogeneous function of degree zero; that is, 
<t>(\x,\y) s \^<t}(x,y) s 0 (x, 2 /). 

If X is set equal to l/x, then 


4p(x,y) ^ 4^i\x,\y) 



which shows that a homogeneous function of degree zero can always be 
expressed as a function of y/x. This suggests making the substitution 
y/x = V. Then, since y = vx^ 


dy 

dx 


dv 

— X + V, 

dx 


Substituting this value of dy/dx in (7-2) gives 

dv 

X^ + V ^ ft>{lyV), 
dx 


This equation is of the type considered in Sec. 6 . Separating the variables 
leads to 

dv dx 

0 ( 1 , 1 ;) — V X 

which can be integrated at once. 


Example: Solve 


+ X 


dy 

dx 


xy 


dx 


This equation can be put in the form 

dy_ __^ 

dx 


xy - X* y/x — 1 
Letting y/x « v and computing dy/dz from y ^ vx give 


t; -h X 


dv 

dx 




dv 


dx » — 1 


Separation of the variables leads to 

dx ^ 1 — 


dv ' 
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and integration yields log a; -f log » — » * c or log w; — t> -* c. Since » * y/x, the final 
answer is log y y/« •• a, 

PROBLEMS 

Solve the following differential equations. In Probs. 4 to 6 find a solution through 
the point (1^). 


1. (x® 4- y®) dy 4- 2 x 1 / dx ^ 0. 
« l/dy y 

3. X cos - —- « y COS- X. 

xdx X 

6* x^y dx » (x^ — y*) dy. 


2. zy' -- y ^ Vx* — y®. 

4^ (x 4" y)y' «* a: ~ y. 

. dy xy - y^ 


Some of the following equations are separable; some are homogeneous. Solve them. 


7* sinh X dy 4- cosh ydx » 0. 

9. x( Vxy 4- y) dx » x^ dy, 
11 . xy' « y 4- xe*'^*. 


8 . 

dx X \/xy 
10. x^y' — y^ « x^yy'. 

12. y' =* y' log y 4 tan x sec^ x. 


8 . Exact Differential Equations. An expn\s.si()n Pix/y) dx 4- Qix,y) dy 
is said to be exact if it coincides with the differential 

dF dF 

dF = — dx H- dy 

dx dy 

of some function F(x,y), that is, if 

dF dF . 

Pix,y) dx + Q{x,y) dy = — dx - dy. (8-1) 

dx dy 

In these circumstances the equation 

P(x,y) dx + Q(x,y) dy = 0 (8-2) 

is simply dF =» 0, and its general solution, therefore, is 

F{x,y) - c. (8^) 

When a function F(Xjy) satisfying the relation (8-1) e.xists, we conclude 
that 

dF dF 

— * Pix,y), — = Qix,y). (8-4) 

dx dy 

Moreover, if d^F/(dz dy) « d^F/(dy dx), we obtain by differentiating (8-4) 
a necessary condition, 

dP dQ 

— (§.5) 

dy dx 

for the existence of F{Xyy), This condition also suffices to construct F{x,y) 
in every rectangular region throughout which P, <2, dP/dy, and dQ/dx are 
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continuous.^ Indeed, on integrating the jfirst of Eqs. (8-4) with respect to 
Xf we get 

F(x,y) « f P(x,y) dx + /(y), (8-6) 


where f{y) is an arbitrary differentiable function of since y appearing 
in the integral of (8-6) is treated as a constant. We next determine f{y) 
so as to satisfy the second of Eqs. (8-4). Differentiating (8-6) with respect 
to y and equating the result to Q{Xyy) give 


so that 

This determines 


dF d 

— - - / Pix,y) dx +S\y) = Qix,y), 
f'iv) = Qix,y) - ^ / f’ix.y) dx. 

fiy) = / [Qix,y) ~ ^ / 


(8-7) 


provided the expression in the brackets in (8-7) is a function of y only. 
But that is always the case, since its derivative with respect to x is dQ/dx — 
dP/dy, and this vanishes whenever (8-5) holds Accordingly, the substitu¬ 
tion of f(y) from (8-7) in (8-6) gives the function F(x,y) and thus the de¬ 
sired solution F(x,y) = c. 


Example: Solve the equation 

(2xy + 1) dx + (x* -f iy) dy « 0. 


This equation is exact, since dP/dy « dQ/dx ** 2x, Thus there is a function F(x,y) such 
that 


dF 

dx 


2xy 4 - 1 , 


dy 


-f 4y. 


( 8 - 6 ) 


From the first of Eqs. (8-8) we conclude that 

F{x,y) - l(2xy + 1) dx +f(,y) 


« -fx 4-/(1/). (8-9) 

To satisfy the second of Eqs. (8-8), we must have 

riP 

— = +/'(y) “ I* + 4v 

dy 

SO that f'iy) » 4i/. 


The integration yields 

m « 2y\ 

and the substitution in (S-9) gives F{x,y) « x*|/ 4* x 4- 2y*. The desired solution, there¬ 
fore. is 

x\ 4- X 4- 2y* « c. 


^ For details and general discussion see Chap. 5, Sec. 9. 
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PROBLEMS 

Integrate the following equations if they are exact: 

1. {e* -f 3) fib + 0; 2. f 2a: -f - 1 

\ y / 2 

5. {$x^y — dr » (3iyb — x^) dy; 4, x dy y dx « 0; 

V y 1 .V 

6, cos - dx » - cos “ dy; 6. x dx y dy 0; 

X* X XX 

1. (3x^y - }/) rfx - (x" 4- 3,vv) rfv ■= 0; 

8. (y cos Ty + 2x) (ii + x cos xy dy = 0; 

9. (/ + 2iy + 1) rfx + (2xv + x») dy = 0; 

10. 3x^v rfx + (x® - 3y^x2) dy = 0. 

9. Integrating Factors. Suppose that 

M(x,y) dx + N{x,ij) dy = 0 (9-1) 

has a solution 

F(x,y) = c, (9-2) 

where F{x,y) is a differentiable function. On differentiating (9-2) with 
respect to a-, we get 

dF dF 

— + —!/' = 0 , ( 9 - 3 ) 

dx dy 

and from (9-1) we find 

M(x,y) + N{x,y)y' == 0. (9-4) 

The elimination of y' from (9-3) and (9-4) gives 

dF/dx dF/dy 

jr ,—7 “ 

M{x,y) N{x,y) 

where M(x,y) is the value of the common ratio. It follows from (9-5) that 

dF dF 

~ = M(a:,2/)i>^(a:,y), — = y{x,y)N{x,y) 

dx dy 

and hence that 

y(x,y)(M dx + N dy) =0 

is an exact equation; namely, it is the equation dF = 0. 

The function m(x,.?/) is termed an integrating factor. It is clear from the 
above disciission that every equation (9-i) has an integrating factor and, 
in fact, an unlimited number of them.^ Nevertheless, it must not be con-* 
eluded that an integrating factor can always be found easily. In simpler 
cases, however, it can be found by inspection. 

* Some integrating factors introduce extraneous solutions y which make nix^y) « 0 but 
do not satisfy (d-1). 
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Thus, in order to solve 

xdy — ydx ^ 0 

which is not exact as it stands, multiply both sides by l/xy. Then the 
equation becomes 

dy dx 

-^0, 

y X 


which is exact. Another integrating factor for this same equation is l/x^. 
Similarly, multiplication by 1/?/^ makes the equation exact. 

Example: Solve the differential equation 

{y^ — X*) dy 4' 2xy dx * 0. 

This is not an exact equation, hut on rearranf^enaent it becomes 
I/* dy 4- 2x1/ dx — dy « 0, 


which can be made exact with the aid of the integrating factor 1/y*. The resulting 
equation is 


which integrates to 


2xy dx - x^dy 

dy -h — -^-« 0 , 

r 


PROBLEMS 

The following problems give a few of the integrable combinations that commonly oc¬ 
cur in practice. Verify the equations by differentiating: 




xdy y dx 
X dy ~ y dx 


-(:)■ 


xdy-^ydx 

as ^ 

xy 

X dy — ydx 


5. )4 d{x^ 4~ y^) *= X dx 4“ y dy; 


6. d(xy) * X dy 4- y dx. 


Solve the following equations by finding a suitable integrating factor: 


7. X dy 4- dx ** y dx; 
9. X dy 4- 3y dx « xy dy; 
11. xdy — ydx ^ xy dy; 


8. (xy* 4- y) dx » (x*y ~ x) dy; 
10. (x* 4- y* 4" 2x) dy « 2y dx; 
12. (x* ~ y*) dy « 2xy dx. 


10. The First-order Linear Equation. An equation of the form 


+ M(x)y == N(x) 


is termed linear for reasons given in Sec. 21. 
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If we set y ^ w, where u and v are functions of a: to be determined later, 
we get on substitution in (lO-l) 

uv^ + vw' + Muv ^ iV, 

or v{u' + Mu) + uv' — N, (10-2) 

If u is suitably chosen, the parenthesis in (10-2) can he made equal to 
zero, thus reducing (10-2) to a simpler form. To this end, set 

a' + Mu - 0, (10-3) 

which is a separable equation for u. We get 

du 

-h M da- = 0, 

80 that log w + J M dx = c. (10-4) 

Since any solution of (10-3) reduces (10-2) to tlie form 

uv' - N, (10-5) 

we choose the simplest one, corresponding to c = 0. With tliis choice, 
(10-4) yields 

M = (10-G) 

and (10-5) becomes 

y' = AVJ(l0-7j 


Since the right-hand memljer in (10-7) d('p('n(K only on x, we get, on in¬ 
tegrating, 




dj + r. 


Kecalling the assumption that // = wr, we get the general solution 


y = c-/^^ <1^1 ATef" dx + 


Example 1. Solve y' -h yvoax « bin 2x. Here M{x) cobz and N{x) « sm 2 j. 


Since JAf dx =» Jwxdx =« (10-8) yicldb 

y «m e" ^ J ' sin S 


sin 2x dx -f re*” 


which IS easily evaluated by replacing sm 2x by 2 sm x cos x. 

Example 2. Solve {x + 1)//' *f 2j/ » (x 4- 1)^. Dividing by x 4* 1 shows that this 
equation is linear with M 2/{x -f I) and ** (x + 1)^. Hcn(‘© 


« (r +- ))-». 


while 
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Thus (10-8) yields 

y - (X + 1)-*J (X + I)‘dr + c(x + l)-» 


(X + D* 

6 


+ c(x + l)-“l 


PROBLEMS 


Solve the following equations. In Probs. 3 to 5 find a solution through the point 

( 0 - 1 ). 

1. (1 + x^) dy “ 2. (z^ -b l)y' + 2xy »* z®. 

8. y' « c~** — 2xy. 4. y' -i- xy — x 0. 

6. y' y cos X ** cos* X. 6, xy' 4- J/ ”» x^ sin x. 

7. Show, on writing Eq. (10*1) in the form 

dy + ^fy dx ^ N dx, 

that is an integrating factor of this equation, and thus obtain formula (10*8)* 

Solve the following equations, each of which is separable, homogeneous, exact, or 
linear. (It is instructive to use several methods w'hen piossiblc.) 


8. y' » y -f cos x — sin x. 
dx 

10. Y“ + ^ y 

dy 

12. v' yx ** //. 


g dy ^ y^ - xVx^ - 
’ dx xy 

11. x2(l -f 4y2) dx + 3yx* dy *• 0. 

18. dx + (1 -- e*') dy » 0. 

V 


11. Equations Solvable for y or y'. Certain special types of equations 
can be soIvcm] by writing p == dij/dx and expressing p as a function of 
X and y. Another metliod is to solve for y in terms of x and p and then 
differentiate with respect to x, using dy/dx = p. These procedures change 
the given first-order equation into a new one. 


Example 1. Solve 2p^ — (2y* 4* x)p + xy^ « 0, where p *= dy/dx. 

Factoring gives {p - y^){2p - x) « 0 so that, at each x, we have either p « y* or 
p « x/2. The fact that y is to be differentiable ensures that one or other of these rela¬ 
tions actually holds throughout an interval. Hence, w^ith p ^ dy/dx^ they can be re¬ 
garded as differential equations and solved in the ordinary way. From dy/dx » y* 
there results 

X 4“ - « Cl, (11-1) 

y 


and from dy/dx • x/2 is obtained 

X* 

y « 4- C 2 . 


( 11 - 2 ) 


Those two sets of curves represent the desired solution. Although there is no advantage 
in doing so, one may write (11-1) and (11-2) as a single equation with a single parameter, 
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Ewample 2. Solve p* •+- PJ/ **• 1, where p «= dy/dx. 

Since it is impractical to solve this equation for p to obtain p « fiy) (which would 
have led to a separable equation), we solve it for y and obtain 


y 



Differentiating (11-3) with respect to x leads to 


which can be written as 
After integration we get 


dy I dp ^dp 

dx ^ p^dx ^ dx* 


dx ^ dp — ip^ dp. 



(11-3) 


(11-4) 


which, together with (11-3), gives the desired solution in parametric form. There is no 
advantage in eliminating the parameter p in Eqs. (11-3) and (11-4), even when it is 
possible to do so. Plotting the curves representing the solution as p varies, one obtains 
not only the locus (x,y) but also the slope p at each point. 

The method used to solve the equation in the preceding example can be 
applied to solve the Lagrange eqvxition 

y = xS{y') + g{y'), (11-5) 

where / and g are differentiable functions of y' = p. On setting y' == p 
in (11-5) one obtains 

y == xf(p) + gip), ( 11 - 0 ) 


Differentiating with respect to x yields 


P 


dp dp 

xr(p)^ + f{p) + g^{p)^* 

dx dx 


wIlmAi can be written as 


dp p - Sip) p - Sip) 


(11-7) 


This equation is linear in x; that is, it is of the form dx/dp + Mip)x «= Nip), 
and it can be solved by the method of Sec. 10. Its solution for a; as a 
function of p, together with (11-6), yields the solution of the original 
equation in parametric form, with p as parameter. 

The reader will ihnd it instructive to apply this method to solve y « 
^ + (l/O^ and ^bow that y ^ cx + c^. 
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PROBLEMS 

Problems 1 and 2 are to be solved by the method of Example 1; Probs. 3 and 4 by 
that of Example 2. 

1. _ 2yp » 3y®. 

3. p^y -f 2. 

5. p^a* — py + I «* 0. 

7. p* 4* (2a: - y)p «« 2xy 

2. X « - p*. 

10. Show that Clairaut’s equation y ^ xp f(p) is a special case of Lagrange’s equa¬ 
tion (11-5), and thus obtain the solution. 

12, The Method of Substitution. Many first-order equations can be 
solved by a suitable change of variable. This has already been demon¬ 
strated in the substitution y ^ vx for the homogeneous equation (Sec. 7), 
in the substitution y — uv o( Sec, 10 , and in the use of p = dy/dx as 
independent variable in Sec. 11 . Further examples of the substitution 
method are given in this section. 

Thus, the Bernonlli equxiiion 

y' + P{x)y = Q{x)y^ (12-1) 

can be reduced to a linear equation by setting z = 

On dividing (12-1) by we get 

y-^y' + P{x)y’-^^^ - Q{x), 

But since = (1 ~ we can write this as 

(^ 1 -*)' + = Q{x), 

1—71 

On making the substitution z ~ we get the linear equation 

2 ' + (1 — n)P{x)z == (1 — n)Q{x), (12-2) 

which is solvable by the method of Sec. 10. 

The equation 

T - ^( rtrtr ) <■«> 

dx \hix “h 02 y "f* 03 / 

can be solved by the substitution x = u — y ^ v -- k if the constants 
h, k are chosen so as to make the resulting equation homogeneous. This 
procedure, which is simply a translation of axes, is illustrated in Example 2 . 

Because of the habitual use of the notation dy/dx, which implies that 
p is a dependent variable, one may fail to recognize that an equation is 
solvable if the roles of x and y are interchanged. For example, an equation 
which is nonlinear in y may become linear if x is regarded as the unknown 
and y is regarded as the independent variable. If an equation seems in- 


2. p2 + 1 « 2p. 

4. p’ -f 2p « c*'. 

6. p® 4-1/^ * 1. 

8. p* 4- (x — c*)p « xe*. 
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tractable as it stands, it is often helpful to interchange t and y, simplify, 
and attempt to solve the new equation. I'hen interchange x and y in 
the sohition of this to obtain the solution to the original equation. The 
procedure, which is illustrated in Example 3, amounts simply to the change 
of variable r « f/, ?/ = .t. 

Rrample 1. Solve the* e<^uatioii i/' 4- 2/ ** 

This is a special case of Bernoulli’s equation. Set 2 =» y~’^ to obtain 2' — » -2x 

by direct calculation or by (12-2). The general solution is 2 4-x -f so that 

-fa; *f is the solution of the original equation, 

JSmtnpk 2. Solve 

dy X — j/ — 2 
dx X 4 2/ -f 6 


by means of the substitution x — u — y ^ v — k, where /?, k are suitably chosen 
Constanta. 

Substituting gives 

du u 4 V (A 4 ^ b) 


If h and k are so determined that 


h — A: 4 2 =» 0, 
/, -f fc - 6 » 0, 


(12-5) 


then (12-4) becomes the homogeneous equation 


dv w — w 
du w 4 » 

whose solution is 


— 2ttv — ^ Cl 

by Sec. 7. Equations (12-5) give A « 2, A: « 4, so that u « 
A; « 1/ 4 4. Substitution in (12-6) leads to the final answer 


X 4 ^ 


{\2Ai) 
X 4 2, i; « t/ 4 


x^ — 2xy — — 4x — 12y «» c 

after simplification. 

Example Z, Solve (x — tf) dy ^ y dx. 

Interchanging x and y gives {y — x*) dx »» xdy or y' — y/x » —x“. This equation 
is linear in y and gives 2^/ « cx — by the method of Sec. 10. Hence the solution of 
the original equation is 2x =» cy — 

Example 4. Show how to solve the equation y' « P(ax + by 4 c), where a, 6, c 
are constant. 

l^et 2 «s ox 4 hy 4 c, so that 2' »* a 4 by\ Combining this wdth the original equa¬ 
tion gives z' -- a ^ by^ bP{ax 4 5]/ 4 c) « bPiz), or » a 4 bP{z). This equation 
is separable. The procedure fails if 6 •» 0, but then the original equation is separable. 


PROBLEMS 

Solve the following special oases of Bernoulli’s equation; 

- fldy , / 

1 , tr-j- — ■» sin x; 
ax X 


8- »' + »•• *»*: 
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3 

xy' 4- y 


^dx zy^ * 
y^ log x; 


4. y' - 4 x^V “ 9; 

3. I/' 4 xy » a;®!/*. 


Reduce the following equations to a form which is homogeneous or has separable 
variables, but do not solve; 


7 . V' 
9. y' 


X 4 y - 1 ^ 
2x 4 y 4" 2 * 
X y 4 2 


8 . y'> 

10. y' 


3 x 4 y 4 6 
3x4y 47* 

cos (x 4 y). 


X 4 y 4 3 

Solve by interchanging x and y and using an appropriate method on the result: 
dx 


11. « j/x* — x; 

dy 


12. y cb « (x 4 y®) dy; 
13. y 4 xy' » e^y'; 14. 1 4 xy' tan y « y'. 

Solve the following review problems by any method: 


16. y(l 4 x^) ^ dx 4 tan ^ z dy « 0; 


^ dx z ^ , 

17. ~ 4 - 4 y-' 
dy V 


0 ; 


19. e^y' « c* 4 e^; 

21. dx 4 2x dy * y dy; 

23. dy « (2y 4 dx; 

26. (x - y 4 1) dx 4 (x 4 y — 1) dy « 0. 


16. (1 4 x“) dy « (1 4 y*) dx; 

18. sin 2y dx 4 2x cos 2y dy ■« 0; 

20 . dx » (yx^ — x) dy; 

22. (x^ 4 y2) dx « xy dy; 

24. y2 » (xy - xV)y'; 


13. Reduction of Order. With y' = p, the transformations 



dp 

dx 


(13-1) 


dp dp dy dp 

dx dy dx dy ^ 


(13^2) 


often enable us to reduce an equation of second order in y to one of first 
order in p. For example, the equations 

F{x,y\y") = 0, (13^) 

F(y,y',2/") - 0 (13^) 


become by (13-1) and (13-2), resi)ectively, 



= 0, 

(13-5) 


1 == 0. 

(13-6) 


These are first-order equations in p, and when p has been found, the sub¬ 
stitution p ^ y* yields a first-order equation for y. 
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Example 1. Find the solution of y^'amy* -•sina' which satisfies the conditions 
y(l) wr 2 and y'(l) *» 1. 

Being free of y the equation has the form (13-3), and it l)ecomes 

dp . 

sm p ** sin z 
dx 

by (18^1), Solving this separable equation yields 

— cos p » — cos ^ 4" r, 

which is reduced to p «* x by the condition p » 1 at x =* 1. Writing dy/dx for p in the 
equation p « x gives on integration the final answer 

y « - 1) + 2 


in view of the condition ?/(l) *= 2 

Example 2, Solve yy’* — 2{y')'^ -f y^ - 0. 

This equation has the form (KM), since it does not contain x. The transformation 
(13-2) gives 

yp -r - 2//- + y^ = 0, 

<iy 

which is a homogeneous equation with y as independent variable. Setting p » vy and 
proceeding as in Sec. 7 give, after calculation, 

p = ±vV"l'+’iV. (13-7) 

With p ** dy/dx in (13-7) we separate variables to obtain the final answers, 

X 4” Cl « ^ smh"'’ — • 

cy 


PROBLEMS 

Problems 1 and 2 are to be solved by the methwJ of Examjilp 1, IVobs 3 and 4 by that 
of Example 2, and Probs. 5 to 7 by whichever method is more suitable. 

1 . (I - x’^)y" * xy'. 2. 4- -= y'. 

8. p" + e*' « 0. 4. p" yy\ 

8. « 1 - X. 6. pp^' « y'\ 

7. xp'' •« 4x - 2p'. 

8 . Solve « 1 4- by both methods of this section, and verify the agreement of 
the results. 


GEOMETRY AND THE FIRST-ORDER EQUATION 

14, Orthagonal Trajectories. In a variety of practical investigations, 
it is desirable to determine the equation of a family of curves that intersect 
the curves of a given family at right angles. For example, it is known that 
the lines of equal potential, due to a distribution of steady current flowing 
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in a homogeneous conducting medium, intersect the lines of current flow 
at right angles. Again, the streamlines of a steady flow of liquid intersect 
the lines of equal velocity potential (see Chap. 7, Sec. 19) at right angles. 
Let the equation of the given family of curves be 

/(a;,y,c) « 0, (14-1) 


where c is an arbitrary parameter. By specifying the values of the param¬ 
eter c, one obtains a family of curves (see solid curves in Fig. 6). Let 
it be required to determine the equa¬ 
tion of a family of curves orthogonal 
to the family defined by (14-1). 

The differential equation of the 
family of curves (14-1) can l>e obtained 
by eliminating the parameter c from 
(14-1) and its derivative, 


^ ^ 0 
dx dy dx 


(14-2) 


Let the resulting differential equation be 



Now, by definition, the orthogonal family of curves cuts the curves of 
the given family (11-1) at right angles. Hence, the slope at any point 
of a curve of the orthogonal family is the negative reciprocal of the slope 
of the curves of the given family. Thus, the differential equation of the 
desired family of curves is 


F 



This is a differential equation of the first order, and its general solution 
has the form 

ii){x,y,c) = 0. (14-3) 

The family of curves defined by (14-3) is the desired family of cun^es 
orthogonal to the curves of the given family (14-1). It is called the family 
of orthogonal trajectories. 

If the equation of a family of curves is given in polar coordinates as 
0 , the tangent of the angle a made by the radius vector and the 
tangent line at any point {rfi) of a curve of the family is equal to r dB/dr 
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(Fig. 7). Hence, by ilm preceding discussion, the differential equation 
of the orthogonal trajectories {>f the given family of curves is obtained by 
replacing r dO/dr by — dr/(r d$) in the differential equation of the given 
family of curves. 



Exa/tnpk Lei it be lequirod to find 1h<* family of <‘urves orthogonal to the family of 
ciioles (tig. 8; 


\r — cjr *= 0 


(14-4) 


The (lifftTential ef|uation of the fiimily f]4--f) can obtained by dilTerentiating (M-4) 

with lespect to x and eliminating the parameter 
c betw <*(‘11 (H-t) and the equation that revsulib 
from the <lifr(aentiation 

The re‘id('r w ill eheck Unit the differential equa¬ 
tion of the family (13-4) i.s 



2x1/ / 4 
fix 


y 


0 . 


Hence, the differential equation of the family of 
curvei. orthogonal to (H-t) is 

dx 


2xy 


dy 


x^ 4 r “ 0. 


This ib a homogeneous differential equation whose 
solution is found to l>e 

X® 4" 2 /^ — cy •« 0. 


Thus, the desired family of curves is a family of circles with centers on the y axis (see 
Fig. 8). 
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PROBLEMS 


Sket-cb the following families of curves, find the orthogonal trajectories, and add them 
to your sketch: 


1 . 4* y* « 

S. y « cx”; 

5. f «■ c; 

7. r « c(l — cos^); 


2 . xy 

x‘ 




4 4.1. « c*- 

4^9 ' 


6. r 
8. r 




1 ^ c cos d 

9. If a and h are constant and X a parameter, show that the family of curves 

a* + X ^ 6» + X 

satisfies an equation, free of X, which is unaltered when y' is replaced by —I/j/'.n Wlmt 
dcH's this indicate concH'rniiig the orthogonal trajectories? 

10. Find the algebraic eciuation, the differential equation, and the orthogonal trajec¬ 
tories for the fainilv of circles tfingent to the y axis at the origin. Verify your result by 
plane geometiy. (The cxjnfiguration is a special case of so-called bipolar coordinxUes.) 

16. Parabolic Mirror. Pursuit Curves. Besides the problem of finding 
orthogonal trajectories, many other questions in geometry lead to first- 
order diff(Teiitiul equations. The following examples show bow geo¬ 
metrical conditions of this sort stem from physical conditions. Ine first 
is taken from optics, the second from 
the theor}' of pursuit. 

Example 1. Find a mirror such that 
light from a point source at the origin 0 is 
reflected in* a beam parallel to the x axis. 

Let the ray of light OP strike the mirror 
at P and be reflect<*d along PR (Fig. 9). 

If PQ is the tangent at P and a, and 
0 are the angles indicated, we have a » 
by the optical law of reflection and a 
by geometry. Hence <A- The equa¬ 
tion 



2 tan (t> 

1 — tan^ 4 > 


gives 


tan 0 ** tan (/S + 0) « tan 24> 

y __2j/_ 

X i - (!/')*' 

since y* « tan Solution of this quadratic equation for y* gives 

, —x ±. Vx* + 3/* 


y 

xdx -i-ydy 

dbVx* -b y® 


dx. 


whence 
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The left-hand member of tbi« is an exact differential, and we get, on integirating, 

rfcVx* + •« X -f c, 

which, on squaring, yields « 2cx + c*. The curves form a family of parabolas with 
focus at the origin. 

Bxample 2. A boat A moves along the y axis with constant speed o. Find the path of 
a second boat B which moves in the left-hand lialf of the xy plane with constant speed 
h and always points directly at A . 


At a time i min after A is at (0,0), we shall have A at (0,a/) and B at say. Since 
the line AB is tangent to the path of the slope of this line equals the slope of the path, 
so that 


y at dy 

X — 0 dx 



or xy' — y ^ —at. 

To eliminate 1, we first differentiate (15-1) and obtain 


(15-1) 

{xy' - y)' « xy'' «« 

dx 


(16-2) 

Since di/di ■■ 6, where a is an arc on the trajectory, we have 


di di da 1 >- 

dx dsdx h 


(15-3) 

With r defined as a/6, substituting (15-3) in (15-2) yields 



xy" - -rVl + y>\ r - 

0 


(15-4) 

which is reduced to a separable equation of first order by letting p ^ y' 
The solution is 

v'-p-smh(rlo«-) --[(-)-(-)] 

as in Sec. 13. 

(16-5) 


and from this, y is found by integration. 


PROBLEMS 

Find the curves in the xy plane which satisfy the following conditions: 

1 . (a) The tangents pass through the origin; (fc) the normals pass through the origin. 

2 . (a) The segment of tangent between a point on the curve and the z axis has unit 
length; (6) the projection on the x axis of this segment has unit length. 

8 . (a) The area bounded by the curve, the x axis, and the ordinate equals the ordinate; 
(6) the area equals the length of the curve from (0,1) to (x,y), 

4. Find the path of a small boat in a wide river with uniform current if the boat has 
constant speed relative to the water and always heads toward a fixed point on the bank. 

6 . Solve Example 2 completely under the assumption that A is at (0,0) and B is at 
(xo,0), at time ( » 0. Distinguish the cases r » 1 and r 1. If r < 1, at what point 
and when does B overtake A? If r « 1, bow close can B get to A? 

16* Singular Solutions. It was remarked in Sec, 1 that a differential 
equation may possess singular solutions, that is, solutions which cannot 
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be obtained from the general solution by specifying the arbitrary constants. 
For investigation of this phenomenon let the family of integral curves 
defined by 

0 (16-1) 

be the general solution of the first-order equation 

« 0. (16-2) 

Assume that the family of curves (16-1) possesses an envelope^ that is, a 
fixed curve C such that every member of the family is tangent to C and 
such that C is tangent, at each of its points, to some member of the family. 
At a point (x^y) on the envelope, the values y, y' for the envelope are 
the same as for the integral curve, and hence these values x, y, y' satisfy 
(1G~2). Thus an envelope of a family of soliUions is again a solution. 

In general, the envelope is not a curve belonging to the family of curves 
defined by (16-1), and hence its equation cannot be obtained from (16-1) 
by specifying the value of the arbitrary constant c. It is known from cal¬ 
culus that the equation of the envelope is obtained by eliminating the 
parameter c between the equations 

4>{x,y,c) « 0 and 4*c{x,y,c) = 0, 

where ^ d4>/dc. 


Example: The family of integral curves associated with the equation 


is the family of circles 


y\y')^ -f y* * a* 
(x - c)* + V* « a*. 


(16-3) 

(16-4) 


The equation of the envelope of the family (16-4) is obtained by eliminating c betw^een 
(16-4) and 0c * —2(x — c) « 0. There results 


y « zta, (16^) 


which represents the equation of a pair of 
lines tangent to the family of circles (16-4) 
(Fig. 10). Obviously, (16-6) is a singular 
solution of (16-3), for it is a solution, and 
it cannot be obtained from (16-4) by any 
choice of the constant c. On referring to 
Sec. 1, it is easy to check that the condi¬ 
tions of the theorem ensuring uniqueness 
of the solution are violated in this example. 



PROBLEMS 

1 . (o) Show that j/ — c *• (x — c)* represents a family of congruent parabolas with 
vertex on the line y *■ x, and sketch. (6) By differentiating with respect to c obtain 
the envelope y » x ^ (c) By direct computation, verify that the parabolas and the 


m 


OEDiNABY mFPEEENTIAI. EQUAYIOKe 


[CHAF. 1 

envelope have the saiiie slope at corresponding points, (d) Obtain a first-order differen¬ 
tial equation for the family, and (e) verify tlxat y ^ ^ is a singular solution of this 

equation. 

A particle on the z axis has velocity v » whore « is the distance to the origin. 
Show that the motion is uniquely determined if the particle is at any point other than 
the origin but that infinitely many different behaviors can occur if the particle ever 
reaches the origin. 

S. (o) Obtain the equation j/j/' + -f- a: 0 for the orthogonal trajectories of the 

family p «• ex + 1/r. (b) Show that p * 2\/z is the envelope of the family, (c) At 
points of the curve y 2\/x find the slope of the solutions of yy' -f (y')^ + x « 0 in 
terms of x. Then find the slope of the curve y » 2V^ in terms of x How are these 
two slopes related? Why? (d) Sketch the family, the envelope, and the orthogonal 
trajectories in a single diagram. 

17. The General Behavior of Solutions. The foregoing paragraphs in¬ 
dicate* that from suitable geometric conditions on a curve, one can obtain 
a differential equation for the curv'e. Now, in this section the point of 
view is to be reversed. Starting from the differential equation we obtain 
certain geometric conditions, which enable us to describe the solution 
qualitatively even when the equation itself cannot be solved. 

The function f(x^y) in the general first-order equation 

dy 

dx 

gives the slope of the solution curve at each point (.r,y). Hence the solution 
curves are increasing functions of x in regions of tlio xy plane in which 
is positive and decreasing in regions where f{x,y) is negative. For 

continuous/(a:, 2 /) the boundarj^ between these regions is part or all of the 
curve 

f{x,y) = 0. (17-2) 

Equation (17-2) gives the locus of the critical points, and their character 
(maximum, minimum, neither) is shown by the sign of f{x,xj) at neighbor¬ 
ing points. The inflection pointvS and sense of concavity are similarly 
found from 

-/x+A/ -/x+U (17-3) 

where /jc s df/dx and fy ^ df/dy. 

For more detailed information one can plot the curves 

=» c, (17-4) 

called isoclines. At any point (x^y) where (17-4) holds, the solution curve 
approximates a straight-line segment of slope c, a fact which can be used 
as a check on the qualitative information obtained from (17-2) and (17-3). 
From this viewpoint (17-1) is equivalent to a direction field in the xy 
plane as discussed in Sec. 1. Any curve whose tangent at each point has 
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the direction of the field is a solution, and conversely. The isoclines 
(17-4) and the direction field discussed in Sec. 1 lie at the basis of some 
methods for numerical solution of differential equations. 



A techni<]ue of obtaining approximate solutions, based on a comparison 
idea, was developed by S. A. C/haplygin for equations of first and higher 
orders (see Fig. ll). l^et ?/i(r), y(x), and 1 / 2 ( 2 ’) be .solutions of 

dvi dy dvo 

-f ~^f{x,y), ~=/ 2 (x,j/ 2 ). (17-5) 

dx dx dx 

By subtraction, the difference 2 / — //i satisfies 

d 

7- (2/ ~ 2/1) = /(2-,2/) - y =* y{^)^ yi yiW- (i 7 - 6 ) 

dx 


Now, if /(.T,?y) >/i(a*,//i) in a range of r, then y — ?/.* i« an itwreadn^ 
function of x in that range. In this case the condition y — y\ == 0 at some 
point To ensures that y — 2/1 > 0 for x > xq and y — y\ < 0 for a; < xq. 
Similar remarks apply to //2 — y. Hence the conditions 

fdx,v) > Kx,y) > h{x,y), 

(17-7) 

2/1 (xo) = 2/(xo) = J/ 2 (Xo), 


in (17-5) enable us to conclude that 

2/1 (x) > 2/(x) > y2(x), 

2/i(x) < 2/(x) < Viix), 

One chooses/i(a;,y) and h{x,y) in such a way that the solutions yi, are 
obtainable by elementary methods. Equation (17-8) then gives an explicit 
estimate for y(x). 


X > Xo, 
X < Xq. 


(17-8) 


A refinement of these ideas leads to an explicit and important inequality for estimat¬ 
ing the error in certain approximations. Let y(x) be an exact solution of y' — /(*,y) 
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through the point (xin^ya), and let yi(x) be an approximate solution through this point. 
Substituting yi(x) into the equation gives 




(17-9) 


where the error term c(x) arises because yi is not an exact solution. Now, what can be 
said about the solution error |i/i ~ y| in terms of the substitution error e(x)? 

To answer this question we suppose that f{x,y) is continuous in a region containing 
(^OfPo) and satisfies a so-called Lipsrhitz condition there; that is, 

|/(x, 2 /i) -f(x,y)\ < k\yi - y\, k const, (17-10) 

for some k and all x, y, and yi in the region. The condition (17-10) stipulates that 
/(x,y) shall not change too rapidly when y changes. In case /„ exists, the mean-value 
theorem gives 

fi^.y) - fix,yi) « fyix,&iv - i/i), y <k <yu (17-11) 

and hence (17-10) holds in any region throughout which 

\fyix,y)\ <k. (17-12) 

Returning to the original question, in (17-9) let E{x) l>e the error in yi, 

E(x) - yxix) - y(x), (17-13) 

Since y{x) is an exact solution, we have y' f(x,y), and hen(*e, subtracting from (17-9), 
dvi dy dE 

™ - - 7 ” “ (^ 7 - 14 ) 

ax dx dx 


If (17-10) holds, and if | c(x) | < m, then (17-14) leads to 


dx 


< \f(x,yi) - fix,y) I -h I c(x) | 


Eix) I -h m. (17-15) 


If we could drop the absolute values in (17-15) and replace the < by 
tain the linear equation 

~ - kEix) -f m. 
dx 

The solution with i?(a?o) « 0 is 


E(.x) - ^ (?*<*-*.> - 1). 
k 


we should ob- 
(17-16) 

(17-17) 


Now, it is plausible and can be proved rigorously that Eix) in (17-17) is the maximum 
possible Eix) subject to (17-15), with x > xo. Hence the solution error E ^ yi — y 
satisfies 


|»i(x) -»(*)! £^(«*|*-**I - 1), 


{ m « max | e(x) |, 
k •* Lipschitz constant, 


(17-18) 


where lx — xo| is used rather than (x — aso) to account for the case x < xo. 

Equation (17-18) leads at once to a uniqueness theorem, for if yiix) is an exact solu¬ 
tion, then e(x) ® 0 in (17-9), hence w » 0 in (17-18), and therefore yiix) « j/(x). 

Example 1. Discuss the integral curves for the equation y' ■« xy — 1 without solving 
the equation. 
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The hyperbola xy «* 1 is tlie locus where y' *» 0. If > 1, then y' > 0 and the 
solution curves are increasing, but if ary < 1, they are decreasing. Hence, ary » 1 gives 
a locus of minima in the first quadrant, maxima in the third quadrant. Since y' *> — 1 
when ar « 0 or y 0, all integral curves intersect the axes at an angle of 135°. From 

y” « ary' -h y “ x{xy - 1 ) -f y « y{x^ -f- 1) _ 

the curve is concave up if y > x/(x® 1) and concave down when this inequality is 

reversed. The curves have the appearance shown in Fig. 12. 



Fio. 12 


Example 2. If y' » sin xy, y(0) «» 1, show' that 

<y< 

at least for 0 < x < 0.8. 

A glance at the graph of sin u shows that 

2 

- tt < sin u < M 

X 


for 0 < tt < x/2, and hence 


2 
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Tlte sokition of the given equatio therefore lies between the solutions to 




xy. 


This gives the dejsired inequality for the range 0 < xy < t/2. Since y < it suffices 
to have 

0 < < ir/2, 

and this is true for 0 < x < 0.8. 


PROBLEli^^ 

1. For the equation y' » y/x ~ 1 , (a) sketchlthe locus y' «= 0 in the xy plane. (6) 
Indicate the regions in vhich y is increasing; decreasing, (c) When is y concave up? 
Down? (d) At what slope do the solutions cross the axes? (c) Sketch the locus where 
the solutions have slope 1, —1,2, —2, 5, —5. (/) Sketch the solutions as well as you 
can. {g) Verify your work by solving the equation. 

2. In what regions of the xy plane are the solutions of y' »= sin (x^ + ,V®) increasing? 
Decreasing? Sketch. 

3. Discuss the equation y* ~ sin (x^ -f y), y(0) 2, by comparing with suitably chosen 

simpler equations. 


AFPIICATIONS OF FIRST-ORDER EQUATIONS 

18, The Hanging Chain, Let it he required to find the curve assumed 
by a flexible chain in equilibrium under gravity (P'lg. 13). With « as arc 



from the point t = 0, let the weight density of the chain lie w{s) lb per ft 
and let the loading function be fix) lb per ft. The equation of the curve 
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y =« y{x) will be obtained from the fact that the portion of chain between 
0 and X is in static equilibrium. 

Equating horizontal forces gives 

To « r cos e, (18-1) 

where is the tension at the lowest point and $ the angle which the tan¬ 
gent to the curve makes with the horizontal. Similarly, equating vertical 
forces gives 

w{$) ds + fix) dx — T Bind (18-2) 

since the weight of the chain-plus-load must be balanced by the vertical 
component of T. Both (18-1) and (18-2) require that the function y(x) 
he differentiable (so that 8 is well defined) and they use the fact that the 
tension is tangential, for a flexible chain. 

From (18-1) w'e have ~ Tq/cobO, so that 

7' sin 8 — Tq tan 8 = 7 ojf/', (18-3) 

the latter equation resulting from the definition of y' as slope. Substitution 
in (18-2) gives 

f w(s) ds + f fix) dx = Toy'. (18-4) 

JO JO 

When w and / are continuous, (IS-^l) may be differentiated with respect 
to X, a procedure which leads to the dilTerential equation for the curve 


in view of the fact that 


ds 

wis) -J- + fix) 
dx 


Toi/‘ 


d 

— / K’(s) ds 
dx Jo 



wis) ds 


ds 

dx 


(18-5) 


Example Show that a uniform chain acied upon by ^gravitational forces alone 

assumes the sluipe of a cnleiiury, and find the tension in terms of the height y. 

Here/(x) « 0, wis) ** wo, a constant. Since ds/dx « Vi -f Eq. (18-5) gives 

u-uVr+V^ = ny". (18-6) 


This is a second-order eriuation, which can be reduced to one ot first order by the method 
of Sec. 13. With p « dy/dx we have 


and hence (18-6) becomes 


dp dp dy 

— 33t-- 

dx dy dx 


cVl d- P* 




(1^7) 




4 
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Hub equation is Beparable, the solution being 
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ey - VTT? (IM) 

when the axis is so chosen that ay ^ \ when p » 0. Equations (18-1) and (18-8) now 
give T ^ Tome e •• ToV 1 -f tan® B » To^l 4 - « Toq/ « mVt which gives the 

teosiozi. Writing p *■ dy/dx in (18-8) and separating variables yield 


as the reader will verify. 


To , 

— cosh — 
To 


PROBLEMS 


1. A flexible weightless cable supports a uniform roadway weighing m lb per ft. 
The tensions at the highest and lowest points are 7’ and To, the roadway is 2a ft long, the 
sag is b, and the length of the cable is 2s. If the cable is symmetric about the y axis 
and has ita lowest point at the origin, show that the equation of the curve is 

tuox® 

2To 

and thus obtain the relations To « w^jTb, T *« woia/b)\/a^/A -f h*, 

a « r \/l + (26a;/a®)® dx, s — o 26®/3a 
Jo 

as 5 0. Hint: 1 + u — 1 u/2 as u —► 0. 

2. One end of a flexible uniform telephone wire is 6 ft above the lowest point, at a 
distance a ft from it measured horizontally and at a distance s ft from it along the wire. 
If M =» awo/To, in the notfition of Sec. 18, show that u satisfies the transcendental equa¬ 
tions (cosh M — l)/u *» 6 / 0 , (sinhu)/^ *» «/o and hence by division the nontrans- 
oendental vQUation tanh («/2) » 6/». Also find the relations To ^ woa/u « wos each u, 
T »■ tcoa (cosh w)/a * wcoth u for Tq and for T, the tension at the highest point. 
The student familiar with infinite series will obtain .simplified expressions by expansion 
of the hyperbolic functions when u is small, that is, when the tension is large. 

19. Newton’s Law of Motion. Newton’s second law of motion states that 
the time rate of change of momentum is equal to the impressed force. In 
symbols, 

- (mv) == F, (19-1) 

at 

where F = component of force in the direction of motion 
m « mass of the moving particle 
V «= ds/di = velocity of the moving particle 
It is supposed that the particle moves in a straight line, its distance from 
some fixed point on tliat line being a. 

The differential equation (19-1) is quite general, since the force may 
depend on the time f, on the displacement s, and, in the case of damped 
motion, on the velocity v. Also the mass may be variable in some problems, 
for example, those concerned with rocket flight or with high-speed electrons. 



BBC. 19] 
Since 


APPUCATIONS OF PIRST-ORDBR EQUATIONS 
d(mv) d{mv) ds d{mv) 

- is*-- - P 

dt ds dt ds 


Eq. (19-1) may be put in the form 

d{Tnv) 

V *- 

ds 
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(19-2) 


( 19 ^) 


which gives an alternative statement of Newton’s law. Multiplying 
(19-3) by m leads to {mv)d{mv)/ds = Fm, which may be written 

- ~ (mv)^ = Fm. (19-4) 

This gives still another formulation. 

In case m and F are knowm in terms of s only, m « w(«), F « F(«), then 
(19-4) may be solved completely: 

Hiinvf — y^irriovof = f F{s)m{s) ds. (19-5) 


If the mass is constant, (19-5) becomes 


— Hmuo « / F(s) ds, (19-6) 

Jh 

since m(s) may be factored out of the integral. Then (19-6) is the law of 
conservation of energy^ for the left side of (19-6) is the change in kinetic 
energy while the right side represents the work done w'hen the particle 
moves from So to s. Thus the right side is the change in potential energy. 
The steps leading to (19-6) are evidently reversible if F{s) is continuous, 
and hence Ne wton's law is equivalent to the principle of conservation of energy^ 
when the rnass is constant and the force ts a continuous fumtion of position 
only. 

When F and m are known functions of a, it has been seen that one can 
obtain a so-called first integral of the equations. If F is a known function 
of the same is true; we have 

mv — moVo == f F{t) dt, (19-7) 

Jto 

by inspection of (19-1). And similarly, when F and m are known functions 
of V, one can write dimv) = w(y) dv -f v?n'{v) dv. Substitution in (19-1) and 
separation of variables now give 


m{v) + vm!(v) 



[chap. 1 


44 OIU>mAEY DIITFEKHNTUI. EQUATIONS 

Tlie same process in (19-3) yields 


s 



m(v) + vm\v) 


vdv. 


For several particles addition gives 



XFi 


(19-9) 


(19-10) 


With M as the total mass M Sm, and with V as the mean velocity, MV ^ 'ZnhVt, 
this may be written {d/dt){MV) « F. Here F <« SF* is the total foree; but since the 
internal forces cancel in pairs, by Newton's law of equal and opposite rt=‘action, F is also 
the U>tal external force acting on the system. The exterision to continuous mass dis¬ 
tributions is made by analogy, the equations being defined as the limiting form of those 
for a set of approximating discrete distributions. Thus any point moving with the mean 
velocity V satisfies Newton's law in the form (19-1). It can be shown that this point 
actually remains *'inside” the body if the v, are suitably restricted, but some restriction 
is necessary. Of course, if the masses are constant, then V =* dS/dt, where iS' is the posi¬ 
tion of the center of mass, MS « 2m,s» In that case the center of mass itself follows 
(Ift-I). 

Example 1. The force on a particle of mass m is proportional to its distance from the 
origin and is directed toward the origin. Find a differential equation for its motion. 

The force is ks if s is the distance from the origin at time i Since the force is directed 
toward the origin, it has at all times a sign opposite to that of a. Thus k is negative, 
and one may write k »» — for some constant oj. Eijuation (19-1) will now give 
d{im)/dt w — or, dividing by m and putting e «« d«/dt, 

~ + «''s-0. (19-11) 


This is the equation for simple harmonic motion, an important type of periodic motion 
that arises in many mechanical and electrical systems. The general solution of (19-11) is 

a • .4 cos {id 4- B) (19-12) 


as shown in Probs. 2 and 3 and in the Example of Bee, 21. Hence the motion is t>eriodic, 
with period 2ir/&> independent of the amplitude A and pluise B, 

Example 2, A gun containing a bullet moves with nonuegative velocity v on a straight, 
horir/ontal, friotionless track and points in a direction exactly opposite to that of the 
motion. The mass of bullet-pius-gun is m, and that of the bullet is —Aw, where Am Is 
negative. If the bullet is fired with velocity r relative to the gun, show that {v — r) Am 
equals the momentum of the gun after firing minus the momentum of the bullet-plus- 
gun before firing. 

By (19-1) the momentum of the Imllet-plus-gun is constant, since there is (we as¬ 
sume) no external force on this system a.s a vrhole. Hence 


mt> ■» (m -f Am)(f> -f Ai/) 4* ( —Am)vj,, (19-13) 

where s + Ap is the new velocity of the gun and i>& of the bullet: 

«& « - c. (19-14) 

Computing (m 4- Am)(e 4- Ap) — mi/ from (19-13) and (19-14) gives the result. 
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Two remarks are in order. Firat, the ^equation of continuity for momentum,^* Fq. 
(19-13), has been seen to follow from Newton's law; it is not a new assumption. Second, 
if one replaces (19-14) by 

f/t « i; -f c (19-16) 


(which is a justifiable alternative), the result is altered only by the second-order term 
Av Am. Hence there is no change at all when the incremente are replaced by difhnrentiais 
as in the following example. 

Example 3. A rocket fires some of its mass backward at a constant rate r kg per sec 
and at a constant speed c m per sec relative to the rocket. Show that the thrust devel¬ 
oped is r(c v) when the velocity of the rocket is v. If the rocket starte with velocity 
Vo and there is no other force acting, t» vo *4- c log 2 when half the mass is used up. 

With m and v the mass and velocity of the rocket at time <, the differential in momen¬ 
tum d{mv) due to external forces is F df by (19-1). That due to loss of a mass —dm at 
speed c — r in the backward direction is d(mv) *• (c — r)( —dm) by Example 2; for 
diJforentials, not increment®, the result is exact. Thus is obtained a fundamental rela¬ 
tion for rocket problems: 

d(mv) « Fd/ — (r — v) dm. (19-16) 

In the present case dm « —r dt, .so that (19-10) gives 


d{mv) 

’"HT 


F *f r(c “* c) 


(19-17) 


after division by dt. Hence the efed of the rocket motor is to add r(c v) to the force F, 
and that is what was to be shown. 

Substituting 

m « mo — r< (19-18) 


for m in (19-17) gives (mo — rt)(dv/dt) « rc -f* F after slight simplification. Hence, by 
separating variables, 

V — «= ( r ) log — ^ for constant F. (19-19) 

\ r / m 


Putting F « 0, m « mo/2 gives the second result. 

Example 4. Starting with velocity vo an electron is accelerated for a distance s by a 
constant electric field of magnitude E. What is the terminal velocity? 

Let c be the velocity of light, so tha'. the mass m of the electron is given in terms of 
its rest mass mo and its velocity v by 

« “ (19-20) 

V 1 — tr/n 

If we write 


■ sin $, 


i-l 


cobO 


(19-21) 


as we may for v < c, then (19-20) gives rn ^ mo sec 6 and mv « emo tan B. Substituting 
in (19-3) with F «* Ee, where c is the charge on the electron, gives 

F eE 


ds 


(rmo tan 6) 


■ CSC B. 


Hence sec* &{de/ds) » {eE/nw?) c.sc and by integration 


seE 

sec • sec ^0 H-? 

mod 


(19-22) 
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where flo refers to the ioitial value. For numerical calculation it is more efficient to use 
the form (19~22) with trigonometric tables than to obtain v explicitly by (IMl). 


PROBLEMS 


1* A brick is set moving in a straight tine over ice with an initial velocity of 20 fps. 
If the coefficient of friction between the brick and the ice is 0.2, how long will it be before 
the brick stops? 

2. Find a value of a for which s » ci sin (o^ -f c»j) is a solution of (19-11). Does this 
expression have enough independent constants to be a general solution? Determine 
Ci and ct in such a way that the displacement s is maximum at i « 0 and has then the 
value A, Determine ci and cz in such a way that the maximum displacement is A and 
the maximum velocity occurs at 1 » 0. 

8. Apply the transformations used in the derivation of (19-6) to obtain the appropri¬ 
ate form of (19-6) from (19-11). Check by direct comparison with (19-6). Solve the 
resulting equation by separating variables, and thus show that the solution obtained in 
Prob. 2 includes every solution. 

4. Suppose the rocket in Example 3 is subject to a retarding force of magnitude 
mg + kVf whfjre g and k are constant. From (19-17) and (19-18) obtain a linear equation 
for t) as a function of t. Show that (mo — is an integrating factor, solve for 

and obtain the position s at time t from s = Jvdt. 

6. The equation of a cycloid Is x ^ -h sin y « 1 — cos 0, Show that the arc s 
from the lowest point satisfies «= 8y, and deduce the eejuation 4v‘^ j7(so — for 
a particle sliding down the curve. By differentiation obtain the equation 4.4^s/dt^ * 
—ys, which shows that the motion is simple harmonic. What is the period? 

6. In a microwave electron accelerator the field is E sin wt. If an electron starts with 
velocity vo, find the maximum poasible terminal velocity. Bmt: The maximum occurs 
when the time for passage is exactly ir/w, for an electron starting at time i «* 0. Use 
(19-8). 


20. Newton’s Law of Gravitation. Another law of Newton is the law of 
gravitation, to which he was led in his attempt to explain the motion of 
the planets. This law states that two bodies attract each other with a force 
proportional to the product of their masses and inversely proportional to 
the square of the distance between <,hem, the distance being large compared 
with the dimensions of the bodies. If the force of attraction is denoted 
by F, the masses of the two bodies by mi and m 2 , and the distance between 
them by r, then 


7mim2 


( 20 - 1 ) 


where 7 is a proportionality constant, called the gravitational constant. 
In the cgs system the value of 7 is 6.664 X 
It can be established that a uniform spherical shell attracts a particle 
at an external point as if the whole mass of the shell were collected at the 
center (see Chap. 5, Sec. 14). Hence, by integration, the same is true for 
a solid sphere provided the density is a function of the radius only. If the 
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sphere is the earth, one can therefore write 

toward the earth, (20-2) 

where =« earth’s mean radius 

r = distance from particle to center of earth 
Q = new constant called acceleration of gravity 
Its value in the cgs system is approximately 980 cm per sec per sec and 
in the fps system 32.2 ft per sec per sec. Since the earth is not a perfect 
sphere, and since the density varies from place to place, the value of g 
depends slightly on location. One uses a plus or a minus sign in (20-2) 
according as the positive direction is taken toward or away from the earth’s 
center. 

It can be shown that a uniform spherical shell exerts no force on a particle which is in 
the hollow space enclosed by the shell (Chap. 5, Sec. 14). Hence the force on a particle 
of mass m at distance r from the center of a sphere is 

j iiri/piu) du (20*3) 

when the density of the mat(*rial forming the sphere is a continuous function p(u) of the 
distance u to the center The special case p(u) » 0 for m > ro gives the result for a par¬ 
ticle outside the sphere, as disoassed previously. 

Equation (20-3) gives 

F ^ mg ~ (20-4) 

for a particle irLside the earth if p is taken as constant In case the particle is close to 
the surface, we have r ^ r*, so that either (20-2) or (20-4) takes the simple form 

F « mg. (20-5) 

The error in (20-5) is less than 1 per cent for heights up to about 20 miles. 

Example 1. Neglecting air resistance, discuss the velo<*ity of a particle falling toward 
the earth. 

The principle of conservation of energy combines with (20-2) to give 



or, after carrying out the integration, 

(2(W) 

If the particle starts from rest at a very great distance, the velocity with which it strikes 
the earth is 

Ve •“ V2^ (20-7) 

as we see by setting wo « 0, ro «* <», r « r« in (20-6). This terminal velocity is also the 
minimum velocity of escaix^ for a particle which leaves the earth never to return. Since 
is approximately 4,000 miles, we find from (20-7) that i>e is nearly 7 miles per sec. 
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Examfie 2. Obtain a differential equation relating the density and pressure in the 
interior of a spherical star if each is a function of the distance to the center only: /> « p(r), 
p «* p(r). Assume /)(r) continuous. 

Consider a column of material of unit cross section extending along a radius from 
r to ft, the radius of the star. The pressure at the base of this column equals the total 
downward force on the column due to gravitation. The differential force on an element 
of the column from r ^ q io r ^ q dq is given by 


dF •= y j^4wv‘i)(u) du j E i,(q) dq, 


in accordance with (20-3), and the total force is given by integration: 


p(r) « ^ * ^{q) dq ^ - f <^(q) dq. 

Jr Jr$ 

Since the right side of Eq. (20-8) has a continuous derivative, so does p(r): 

^ « -4,(r) = f 47ruVu)du. 

dr 7o 

Multiplying by r*/p(r) and differentiating again lead to 


(20-8) 




Example 3, Assuming conservation of energy for motion in a curved path, obtain an 
expression giving the period of a simple pendulum. 



Fig. 14 


Let P denote the position of a pendulum bob suspended from 0, and let d be the 
angle made by OP with the position of equilibrium OQ, as shown in Fig. 14. The work 
required to change B to any other value a is the work required to raise the bob through a 
vertical distance a cos 0 — a cos a, if a is the pendulum length. With a chosen as the 
angle for maximum displacement, so that *» 0 at d « a, conservation of energy gives 


1 

2 



-ma‘ 
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where m is the mass of the bob. Separating variables gives t in t^rms of $ and hence, 
implicitly, 8 in terms of U In particular the time required for 8 to increase from 0 to a is 


r 

4 




dB 


^9 Jo v cos — C50S a 


(20-9) 


so that the period T depends on the amplitude a. This dependence leads to the so-called 
circular error in pendulum clocks. 

The identities cos 8 1 — 2 sin® (^/2), cos « 1 — 2 sin® («/2) give 

la r d 8 a 

T « 2^J- / —. fc « SiU-* (20-10) 

Jo V k® - sm® (0/2) 2 

If a new variable of integration <t> is defined by 

sin ~ ^ sin (20-11) 

them <t> ranges from 0 to v/2 when $ ranges from 0 to a. Also by (20-11) 

2k cos 4>d4> 2\/P'--sin® (0/2) 

^ --- -r^-rr^- r=:r-=rr=-~ (HS. 

COS 8/2 \/l — k^ sin® <t> 

Substitution into (20-10) yields a so-called elhphc integral 



d(f> 

\/1 — /c® sin® (i> 


k 


oc 

-sin-. 


( 20 - 12 ) 


The advantage of (20-12) over (20-9) is that (20-12) has been extensively studied and is 
available in tatd(*s A series expansion is easily obtaintM.!, by expanding the radical 
for small a. The result is 




The function 


F(k,x) « f- 

Jq \ 


\/l — hiri® 0 

is called the elliptic integral of the first kind. See (^hap. 2, See. 10. 


PROBLEMS 

1, A stone is thrown vertically upward with velocity 8 fps at time < 0. Using 

(20-5), write an expression for the position and velocity at time i and also for the velocity 
as a function of distance s. Find the time at which the velocity is zero, and show that 
the height is then maximum. Show that the maximum height agrees with that obtained 
by equating kinetic and potential energy, that is, with mgh ** mvo/2. 

2 . A particle slides down an inclined plane, making an angle 6 with the horizontal. 
If the initial velocity is zero and friction may be neglected, the component of force in 
the direction of motion is F » mg sin 0, Wliat are the velocity of the particle and the 
distance traveled during the time f? Find the speed as a function of the vertical distance 
fallen, and verify that the same result would be given by equating energies as in Prob. 1. 
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At any given instant, show that the locua o£ the particles obtained for various Os is a 
circle (Fig. 15). 

3. Suppose the pressure in the atmosphere is a known function of the density, p » /(p). 
Show that the height h at which the density has first dropped to zero from the soa-level 

rpo 

value po satisfies ffh/(l 4- ^/r«) -• / [/'(p)/p]dp, when (20-2) is used. Hence the de¬ 
pendence of gravitation on distance introduces an effect which depends on h only, not 
on/(p), and the effect is less than 0.02k when h is less than 80 miles. Obtain an explicit 
expression for h in the case of adiabatic expansion, p « kp®. For what values of a does 
this give a finite height? (For air a *« 1.5; the height thus obtained turns out to be about 
18 miles.) 



4 . A man and a parachute weighing w lb fall from rest under the force of gravity. 
If the air resistancfi is proportional to the square of the speed v, and if the limiting speed 
is vof find the speed as a function of the time t and as a function of the distance fallen s 
UiiU' (w/g){dv/dt) « u) — kv^, 

5. A projectile is fired, with an initial velocity vo, at an angle a with the horizontal 
Find the equation of the path under the assumption that the force of gravity is the only 
force acting on the projectile. For w4at a is the range maximum? Describe the region 
which is within the range of the gun. Hmt: Find the envelope of the trajectories when 
vq is fixed but a varies. 

6. A cylindrical tumbler containing liquid is rotated with a constant angular velocity 
about the axis of the tumbler. Show that the surface of the liquid assumes the shape of 
a paraboloid of revolution. Hint: The resultant force acting on a particle of the liquid is 
directed normally to the surface. This resultant is compounded of the force of gravity 
and the centrifugal force, since pressure at the free boundary is zero. 

7 . Water is flowing out through a circular hole in the side of a cylindrical tank 2 ft 
in diameter. The velocity of the water in the jet has the value which it would attain 
by a free fall through a distance equal to the head. How long will it take the water to 
fall from a height of 25 ft to a height of 9 ft above the orifice if the stream of water is 
1 in. in diameter? 

8 . Water is flowing out from a 2-in. horizontal pipe running full. Find the discharge 
in cubic feet jier second if the jet of water strikes the ground 4 ft beyond the end of the 
pipe when the pipe is 2 ft above the ground. 
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UNSAK DIFFERENTIAL EQUATIONS 

21. Irinear Homogeneous Second^rder Equations. An equation of the 
form 

y'' + pi(^)y' + P2(^)2/ « 0, (21-1) 

in which pi(x) and ^ 2 ( 2 ^) ere specified continuous functions of x in a given 
interval {a,b), is called a linear homogeneous equation of second order. 
From the existence and uniqueness theorem of Sec. 1 it follows that this 
equation has a unique solution for every x = xq in (a,6), satisfying the 
initial conditions 3 /(xo) « yo> = 2/o* Thus, the integral curve for 

Eq. (21-1) is determined uniquely when the ordinate and the slope of the 
curve are specified at a given point of the interval. 

Equation (21-1) is called linear because its solutions satisfy the following 
linearity properties: 

1. If 1 / = 2 /i(x) is a solution of (21-1), then y « cyi{x)j where c is a 
constant, is also a solution. 

2. If ^ = t/i(x) and y = y 2 (^) are tw^o solutions of (21-1), then their 
sum y ~ 2/1 (x) + y 2 {x) is also a solution. 

It follows from these properties that the sum of any number of solutions 
of (21-1) each multiplied by a constant is also a solution. 

The proof that properties 1 and 2 hold is simple 

Thus, suppose that y « yi(x) is a solution of (21-1); then the substitu¬ 
tion in (21-1) gives an identity 

y'i + PiVi + VtVi ^ 0. (21-2) 

We must show that 

(^ 2 / 1 )" + ViicijY + ( 21 - 3 ) 

vanishes identically for every constant c. But since c can be taken outside 
the differentiation sign, we can write (21-3) as 

c{y[ + PiVi + p 2 yi)r 

and this vanishes because (21-2) does. 

This establishes property 1. 

To establish property 2, suppose that y » 2/1 (x) and y « 2 / 2 (x) are two 
solutions of (21-1). Then 

Vi + ViVi + V2yi ^ 0, 
yi + pm + P22/2 ^ 0 . 

We must show that 

(yi + F2)" + PiiVi + 2/2)' + p2{yi + 2/2) ^ 0 . 


(21-5) 
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iBasmuch as the derivative of the sum of two functions is the sum of the de¬ 
rivatives, we can rewrite the left-hand member in (21-5) as 

(^i + fxVi + PzVi) + {yl + ViVz + P22/2)# 


and this vanishes by (21-4). 

Let us suppose now that by some means we have obtained two solutions 
yi{x) and y^ix) of (21-1). Then by the foregoing 

y{x) « CiViix) H- C2^2(a^) (21-6) 


is a solution for every choice of the constants We say that (21-6) is 
a general solution of (21-1) provided that for a suitable choice of the con¬ 
stants Ci the solution satisfies arbitrarily specified initial conditions, 

2/(^0) = 2/0, y'{X(s) = iIq, (21-7) 

To determine the restrictions on y^ix) and \j 2 {x) envsuring that the solution 
(21-6) is, indeed, general, w'e insert (21-6) in (21-7) and obtain two linear 
algebraic equations, 

CxViix^) + C2l/2(^o) == 2/0, 

( 21 -o) 

C\yi{Xo) + C2?/2(j’o) = y[h 


for Cx and C 2 . The system (21-8) can be solved for Cj and Cj (for arbitrarily 
specified Xo, yo> and 2/0) if, and only if, the determinant 


W(y 1 , 2 / 2 ) ^ 


Vxix) y 2 {x) 
y'lix) y^ix) 


^ 0 


(21-9) 


for eveiy x = Xo in the interval. If W(2/i,2/2) == 0 for some value of x, 
the constants c, cannot bo detennined for every choice of ijo and y'o and the 
solution (21-0) is not general. The doterininant W(y 1 , 1 / 2 ) is called the 
Wronskian after tlie Polish mathematician G. Wronski, who deduced 
the criterion (21-9) for the generality of solution (21-6). 

The condition (21-9) is equivalent to the statement that the solutions 
yi{x) and y 2 (x) are linearly independent. We say that yi{x) and yzix) 
are linearly independent if the identity 

CiViix) + C2y2(x) s 0 (21-10) 

can be satisfied only by choosing Ci = C2 * 0, When nonzero constants 
Cl and C2 can be found such that ciyi{x) + C22/2(^) - 6, we say that yi{x) 
and y 2 {x) are linearly dependent. In other wordKS, linear independence of 
yi{x) and y 2 {x) means that the ratio y 2 {x)/yi{x) is not a constant. But if 
this ratio is not a constant, its derivative 

1 / 22/1 y[y2 

yi 


(21-11) 
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is not identically zero. We note that the numerator in (21-11) is precisely 
^(l/i,l/2) ** y%y\ ““ ViVii* We have shown that if the solutions 2/1 and 2/2 
are linearly independent, then Wiyi^y^) 9 ^ 0 for some value of x. Con¬ 
versely, if W(2 /i, 2/2) « 0 for some x — xq, so that 

Viipto) V2{xa) ^ ^ 
yUxo) y'iixo) 


we can show that the solutions 2/1 (x) and 2/2W are linearly dependenty for 
we can choose nonzero constants Cj and r2 in (21-6) so that at the given 
point X = Xo our solution autislies the initial conditions 

2/(xo) « 0, 2/'(xo) 0. (21-12) 

But if y{x) in (21-6) satisfies these conditions, then y{x) s 0 because there 
exists only one solution of Eq. (21-1) satisfying initial conditions (21-12) 
and a solution y{x) = 0 obviously satisfies these conditions. We have 
thus shown that the nonzero constants Ci and C2 can be found such that 
C\yi(^) + <^2?/2(^) ^ 0 for all values of x, and hence the solutions 2/1 (x) 
and 7/2(^) linearly dependent.^ 

It follows from this that the problem of finding the general solution of 
(21-1) reduces to the search for some pair of linearly independent solutions 
1/2W- ft should be remarked that no formula is available for the 
determination of solutions of the general second-order linear equation 
In the spe(;ial instance when the coefficients pi and P 2 in (21-1) are con¬ 
stants, the general solution, as we shall see in the following section, is 
deduced easily. 

Example: Verify that 

y ^ Cl sin x •+• C 2 cos x 

is the general solution of 

+ y « 0 

and determine the particular solution such that 

?/(()) - 1, yXO) « (21-15) 

The fact that yi *= sin x and j /2 «» cos x are, in<J(^ed, solutions of (21-14) is easily verified 
by substituting y » siiix and y *« cosx m (21-14). lienee their linear combination 
(21-13) is a general solution provided that the determinant (21-9) does not vanish. In 
our case, 

If(yi,l/2) * 

and thus (21-13) is the general solution. To determine the constants c* such that the 
solution satisfies conditions (21-15), we form the set of Eqs. (21-8), 

Cl sin 0 -h C 2 cos 0 ** 1, 

Cl cos 0 - C 2 sin 0 «• 


sm X cos X 
cos X — sin X 


(21-13) 

(21-14) 


* See in this connection Prob. 0, See. 21. 
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from wbioh it follows that cj •» c® ** 1. Thus the desired solution is y »*• sin a; + 
008 as. 

PROBLEMS 


t* Verify that y •* and y «■ 6““* are linearly independent solutions of y'* -- y ^ 0, 
Also show that yi ■# sinh x and yj * cosh x are a pair of linearly independent solutions 
of this equation. 

2. Show that y «« cie* + is the general solution of y" — 3y' + 2y » 0, and find 
the solution satisfying the conditions y(0) •* 0, y'(0) » 1. What is the solution satisfying 

m y'(0) » 0? 

9. Show that yi « l/x and y® ** are linearly independent solutions of x^y'^ — 
3fl;y' — 5y w 0 if a; 5^ 0. 

4. Verify that y « + cjx is the general solution of x^y'^ — 2xy' + 2y ** 0 if 

X 5 >^ 0, and find the solution such that y(l) « 2 and y'(l) »* 0. 

5. Show that y « cie}^ + is the general solution of y" — 4|/' -f* 4y *» 0, and 
find the solution for which y(0) ** 1, y'(0) *» 4. Also find the solution such that y(0) « 
y'(0) « 0. 

6. Compute the derivative of W(yi,y 2 ) =» yiy® — y^Vu where yi(x) and y^ix) arc two 
solutions of (21-1). Show that yiy® — y^Vi 4- Pi{x)(yiy 2 — y 2 yi) ** 0 and dWIdx + 

Pi{x)W «« 0. Thus TV(yi,y 2 ) *• where Wo is the value of lV(yi,y 2 ) at 

X *» xq. Conclude from this that if W(yi,y 2 ) does not vanish at x xo, it does not 
vanish for any value of x. This result is known as Abel's theorem. 

22. Homogeneous Second-order Linear Equations with Constant Coef¬ 
ficients. Consider the equation 

y'" + viv' + V2y ^0 (22-1) 

with constant coefficients pi, p2- If we substitute 

y = (22-2) 

in (22-1) and note that y' = me"**, y" = m^e”**, we obtain the equation 
(m^ + pim + P 2 )e^'' = 0, 


or 


+ Pim + P2 = 0, 


(22-3) 


since e*"* 9 ^ 0. Thus, if m in (22-2) is chasen as a root of the diaracteristic 
equation (22-3), then (22-2) will be a solution of the given equation. The 
roots of the quadratic equation (22-3) arc 


-Pi dr Vp? - 4p2 

m -- 

2 

If Pi — 4p2 > 0, there will be two distinct real roots, m 
« m3. In this event, 

y = y =s e"*** 


(22-4) 


mi and m 


are a pair of linearly independent solutions of Eq. (22-1), since 
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is not a (Constant when mi ^ m2. Hence if mi 9 ^ m2, the general solution 
of (224) is 

y == + C 2 e^^^. (22-5) 

If Pi — 4p2 < 0, the roots (22-4) are conjugate complex numbers^ 
mi =« a + hij m2 *= a — W, 


and the complex functions 

2/1 = ^ ^(a-ln)x ( 22 - 6 ) 

are linearly independent solutions of (22-1), We can write (22-6) in a 
trigonometric form with the aid of Euler’s formula [cf. Eep (17-3), Chap. 2] 

^(a±hi)z _ bx ib i sin bx), 


so that yi = e®'^(cos bx -f- i sin bx)^ 

7 j 2 = 6"'’^(cos bx — i sin 6x), 


(22-7) 


are the complex solutions of (22-1), 

We show next that wh(m Eq, (22-1) with real coefficients pi and pz has 
a complex solution of the form y = 11 + fr, then the real functions u and 
r are solutions of this equation. Indeed, the substitution of y — u + tv in 
(22-1) yields on rearrangement 


(u" + piu' + p2u) + i(r" + piv' + p2v) = 0, 


and this can vanish if, and only if, 

w" + piw' 4 - P2^ ~ 0, 


v" + piv' + P 2 V == 0 . 

Thus y u and y v satisfy (22-1). 

Referring to (22-7) we see that corresponding to a pair of complex roots 
m = n ± hi of tlv' characteristic equation, we have a pair of linearly in¬ 
dependent real solutions 

7/1 = c®"*" cos bXy 7/2 = sin bx. (22-8) 

It remains to consider the case when pi — 4p2 = 0. In this event the 
characteristic equation (22-3) has a double root 

-Pi 

nil = m 2 = --—» 


and the foregoing method yields just one distinct solution yi *= with 
m zsz --pi/2. We can verify by direct substitution that another solution 
is 1/2 * which is obviously linearly independent, since y 2 /yi =« 
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X 5^ const* Thus, when the characteristic equation has a double root 
fn « —pi/2, the general solution of (22-1) is 


y «: (ci + C2x)e’”^ (22-9) 

In the following section we deduce this solution with the aid of the useful 
^ notion of differential operators, w'hich will be of help in resolving the cor¬ 
responding situation involving multiple roots of linear equations with 
constant coefficients of order higher than 2. 

We illustrate the foregoing discussion by examples. 


Example 1. Find two linearly independent solutions of y"' + 3j/' -f 2]/ » 0, and thus 
obtain the general solution. Referring to (22-3), we see that the eharact/eristic equation 
in tliis case is 

m* 4- 3m -h 2 = 0, (22-10) 


which, on factoring, yields 

(m + l)(m -f 2) « 0. 


Thus, the rewts of (22-10) are mi « —I, = ~2, and hence the general solution is 

y ** 

Example 2. Solve y" + 2|/' 4 «*>v =» 0. The characteristic equation is 

m2 4 2m 4 5 « 0, 

md hence 


m 


-2 db V 4 - 20 

“~2 


~1 =fc 2 f. 


Accordingly, the complex solutions are 

„ ^(- l- 20 r ( 22 - 11 ) 

and by (22-8) the linearly independent real solutions are 

cos 2x, y 2 *= e~^ sin 2x. (22-12) 


It should be remarked (hat for many purposes the complex form of solutions (22-11) 
is just as useful as the real form (22-12). 

Example 3. Solve y" 4 2^/' 4 1 / ** 0. I'he characteristic equation 

4 2m 4 1 0 


has a double root m « — 1. Accordingly, a pair of linearly independent solutions of the 
given equation is j/i «■ ** xe~^. By (22-9) the general solution is 

^ *= (ci 4 C 2 x)e'^^. 


PROBLEMS 


Find the general solutions of; 

1. y" 4 *dy' - 54j/ * 0; 

3. y" - 2|^' 4 y 0; 

ft. 4 4i/ - 0; 

7. y" - 4y' 4 5a/ « 0. 


2 . y" ~ 6y' 4 « 0 ; 

4. j/" — 4|/ « 0; 

6. - 4a/' 4 4a/ « 0; 
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234 Differential OperatorS4 We introduce a new notation for the deriv¬ 
ative symbol and write D ss d/dx and, more generally, m d^/dx^. 
Thus, D sin z means (d sin x)/dx ~ cos x, and sin x ^ (d^ sin x)/dx^ 
« d/dx (d sin z)/dx « —sin x. Since 


dcu{x) du 

-- »= c — 

dx dx 

and 

d(u + v) 

dx 

du dv 

dx dx 


we see that 





Dcu{x) « cDu 

and 

D{u + v) 

= Du + Dv, 

(23-1) 

Moreover, since 

dy 





dx^ dx 

dx"+‘’ 



we can write 

D^(Dy) = 

^ D”+hj. 


(23-2) 


We agree also that 


(D + m)y ^ Dy + my. 


If the s3Tiibol (D + mi){D + mg)?/, where mi and m2 are constants, is 
interpreted to mean that (D + 7«i) operates on (D + m 2 )y ^ (dy/dx) 
+ m2i^, we find that 

(Z) + m{)(D + m 2 )y = [Z>^ + (mi + m 2 )D + mim 2 ]y. (23-3) 
From the structure of the right-hand member of (23-3) it follows that 

(D + m\)(D + 'f^ 2 )y == (Z-^ + m^{D + mi)y. (23-4) 

Making use of these properties, we can write Eq. (22-1), namely, 
d^y dy 

“2 ^ ~ (23-5) 

as {D^ + PiZ) + V 2 )y *= 0, (23-6) 

in which the differential operator 

+ piD + p2 

behaves as though it were an algebraic polynomial. 

We observe that this polynomial is identical with the polynomial in the 
characteristic equation (22-3). On noting (23-3), we see that (23-6) can 
be written in factored form as 


(Z) - mi){D - m 2 )y - 0, (23-7) 

w^here mi and m2 are the roots of (22-3), Now, if mi 9 ^ m 2 , the general 
solution of (23-7), as shown in the preceding section, is 

y mm CiC'^i* + C2e’^*. 
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To obtain the general solution of (23-7) when mi 
as follows: We set in (23-7) 

(D - m)y = V 

and obtain a first-order equation for 

(D — m)v = 0, 
dv 

or -mi; = 0. 

dx 


(CHAF. 1 
m2 s m, we proceed 

(23-8) 

(23-9) 


Its general solution is v = The substitution of this in the right- 

hand member of (23-8) yields the first-order linear equation for y 

dy 

- -my == 

dx 

whose general solution is easily found from (10-8). We thus get the solution 

y (23-10) 

which agrees with (22-9). 

Example 1. Find the general solution of y" -1- by' -i- 6|/ * 0. This equation can be 
written as 

(D* + 5D -f 6)2/ « 0. 

On factoring the operator we get 

(D + 2){D + 3)2/ - 0, 

and thus the general solution is 

y =* 4- 

Example 2. Solve y" — 4y' 4* 42/ « 0. We write this equation as 

(Z>2 - 4Z> 4* 4)2/ « 0 
or (D ~ 2)(D - 2)2/ - 0. 

Since the roots of the characteristic equation are equal, the general solution is 

y ** cie^ 4- C 2 xe®®. 


PROBLEMS 

Solve: 

1. (I)® - - 0; 

8. (2>* 4- D - 2)2/ - 0; 

5. (JO* 4“ 2D + l)y - 0. 


2. (D* - l)y - 0; 
4 . (D - 3)V - 0; 
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24. Konhomogeneous Second-order Linear Equations. The equation 

y" + + V 2 {x)y « /(x), (24-1) 

in which the right-hand member /(x) is a known continuous function, is 
called nonhomogeneoics. The existence and uniqueness theorem of Sec. 1 
guarantees that this equation has one, and only one, solution satisfying 
the conditions 

l/(xo) = Voy 2/'(xo) = yi 

whenever the coefficients Pi(x) are continuous functions. If y =® u(x) 
is any solution of (24-1) and |/i(x) and y 2 ix) are linearly independent 
solutions of the associated homogeneous equation 

y" + Pi(x)y' + p2(x)2/ === 0, (24-2) 

the general solution of (24-1) is 

y ^ Ciyiix) + C27/2(x) + w(x). (24-3) 

The fact that (24-3) is, indeed, a solution of (24-1) follows upon substitut¬ 
ing (24-3) in (24-1) and noting that 

u"(x) + Pi(x)u'(x) + p 2 (x)u{x) = f(x) 

and that y = Ciyi(x) -h C 2 ?/ 2 (x) satisfies the homogeneous equati'^.n (24-2). 
The proof that (24-3) is the general solution is virtually identical with the 
proof in Sec. 21 for the homogeneous equation.^ 

We shall see in Sec. 28 that a particular integral u(x) of (24-1) can al¬ 
ways be determined whenever the general solution ^ of the associated homo- 
geneous equation (24-2) is known. In special instances, however, particular 
integrals of nonhomogeneous equations can be deduced without the 
knowledge of the general solution of the homogeneous equation. This 
vsirnpler technique, based on judicious guesses of the probable forms of 
particular integrals, is known as the method of undetermined coefficients. 
It is applicable to linear equations with constant coefficients only when the 
right-hand member f(x) has certain special simple forms. 

We illustrate the essence of this method by several examples and develop 
it in greater detail in the following section. 

Example 1. The right-hand member of 

y" -I- 3y' 4- (24-4) 

suggests that it probably has a solution of the form y » oe*, for the differentiation of 

* The only difference is that the terms yo — tt(xo) and yj — ui(io) instead of yo and yj 
now appear in the right-hand members of Eqs. (21-8) 

* This general solution is often called the compUmentary function. 
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exponentials yields exponentials. Accordingly, we take y •« oe* as our trial solution, 
substitute it in (24-4), and obtain 



o«* 4* Sac* 4- 2ae* « 2c*. 


On dividing by c* we get 

6a » 2 or a 


Thus, y *» H®* is ^ solution of (24-4). The characteristic equation of the associated 
homogeneous equation 

y'' + 3y' 4- 2y « 0 (24-6) 

is 

37n 4- 2 » 0 


or 

(m 4- l)(wi -f 2) » 0. 



Hence a pair of linearly independent solutions of (24-5) is y « y * and the 
general solution of the given equation (24-4) is y =» Cje“* + C 2 e~^ -f- J^c*. 

K the solution of (24-4) satisfying the given initial conditions is required, the con¬ 
stants a must be determined from these conditions. For example, if we seek the solu¬ 
tion such that y(0) « — nnd y'iO) « 0, we obtain, on setting a; « 0 in the general 
solution, 

— Cl -f cj -f 
or Cl 4- C2 « —1. 

Also, y' «« — cic”’* — 2 c2C~‘** 4* 

and since y'(0) -> 0, we have 

0 » —Cl — 2 c 2 -f H, 

nr Cl -h 2 c2 « 

We eafflly verify that ci *■ — cj « 4-^, and the desired solution therefore is 

y - -’’Ae-” + Ae-^ + He*. 

Example 2. If we attempt to obtain a solution of 

y" 4- 3y' 4- 2y « (24-6) 

by taking a trial solution y «■ we get 

— Sac*"* 4" 2ac”‘* «■ 2e““*. 

This gives a nonsensical result, 0 * 26*”* The reason that the trial solution of the form 
y « oc""* is not suitable in this case is the following: The homogeneous equation asso¬ 
ciated with (24-6), as we saw in the preceding example, has a solution y « ac“"®, and the 
substitution of it in (24-6) naturally makes its left-hand member vanish. In this case 
we take the trial solution in the fomi y *» Then, y' *» — 0 X 6 “**, y'' « 

—aa*“* — 4* oxC"*, and the substitution in (24-6) now yields 

4* axe~* 4- 3ac”^ - 3axe“* -f- 2are“** - 2s"^, 

or *» 2«~*. 

Thus a 2, and a solution of (24-6) is y — 2xe~*. The general solution of (24-6), there¬ 
fore, is 

y »« -H cse"'** 4* 
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Example 3. Find the general solution of 

y'' + 2y' -f y « e-*. (24-7) 

We recall (Example 3, Sec. 22) that a pair of linearly independent solutions of the asso¬ 
ciated homogeneous equation is j/ <» y *» xe""*. Accordingly, neither y «■ o«~* nor 
y « axe'~^ is suitable as a trial solution of (24-7). In this case we take the trial solution 

y « ax*c~*. (24-a) 

We compute 

y' « 2ax€^* — 

y'* m 2 ac~* -- 2(2X6”* — 2 axe”* 


and on making substitutions in (24-7) find 

2ae”® — 4nx6““® -f ax^c”® -f- 4axe”® — 2ax^c“~* -j- ax^c”* •« 


or 206 ® « c”®. 

Thus, a ^ yi, and from (24-8) y -» }^x*«”“® is a solution of (24-7). Its general solution 
is 

y »= cic”® -f C 5 X 6 ”® -f 

These examples suggest a procedure to be followed in obtaining particular 
integrals of equations with constant coefficients of the type 

y" + ViV' + ViD = (24-9) 

The characteristic equation associated with (24-9) is 

-f p\m -f P2 = 0. (24-10) 

If this equation has two distinct roots m = mi and m = m 2 , then the 
linearly independent solutions of the homogeneous equation 

2/" + ViV' + P 22 / == 0 (24-11) 

are y == e*”!* and y == When Eq. (24-10) has a double root m 2 == mi, 

the linearly independent solutions of (24-11) are y = and y — 

Now, ii kin the right-hand member of (24-9) is not equal to either mi or 
m 2 , Eq. (24-9) has a solution of the form y = oe**. If A: is a simple root 
of (24-10), then (24-9) has a solution of the form y — axe*"*". When A; is a 
double root of (24-10), the particular integral can be taken in the form 
y « ox^c*®. 

Similar considerations apply to equations of the form 

y"' + Pi2/' + = -^0 + *4ix H-h -dnx". (24-12) 

W ^ 0, we can take the trial solution 

2/ « ao + ttix + • * • + (24-13) 


' This means that m » 0 is not a root of the clmracteristic equation (24-10) and hence 
(24-11) has no solution y » const. 
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The substitution of (24«13) in (24-12) then yields on comparison of like 
powers of x on both sides of the resulting equation the values of the un¬ 
known constants a». If p 2 =* 0, the characteristic equation (24-10) has 
m 0 as one of its roots. In this event the trial solution can be taken in 
the form 

y ^ x(ao + aiX-\ -h anX^). (24-14) 

We illustrate the use of these rules by two examples. 

Example 4. Find a solution of 

V" + + 2y « 1 4 - 2x. (24-15) 

Binoe pa ■* 2 3 ^ 0 , we take the trial solution 

y^oo^a^x, (24-10'; 

substitute it in (24-15), and find 

3ai 4- 2(ao 4- aix) ** 1 4- 2x. 

On comparing like powers of x, we get 

3fli 4" 2 ao *“ 1 , 2^1 ** 2 , 

whence 

ai ** 1 , oo *» — 1 . 

The substitution of these values in (24-16) gives the desired solution y « —I 4- ar. 
Example 5. Find the solution of 

y" + 3/ = 1 - 9x* (24-17) 

satisfying the conditions y(0) « 0 , y'(0) « 1 . 

Since p 2 ** 0 in (24-17), we seek a solution in the form 

y « a;(ao 4- aix 4- aai*). (24-18) 

We compute 

1 ^' ** oo 4- 2aix 4- Soax®, 
y" ** 2ai 4* Ooaaj 

and insert in (24-17). The result is 

2 ai 4- 6aax 4* 3(ao 4" 2aix 4* Sojgx^ « 1 ~ 9x* 

or 2<ii 4- 3ao 4 - (602 4- 6 oi)x + Hoax® « 1 — 9x*. 

Hence 

2a\ 4 “ 3ao *= 1 , 

602 4" 6 ai ■* 0 , 

902 *• —9. 

Solving these equations, we get 

02 «* -1, oi * 1, oo » — 

and the substitution of these values in (24-18) gives 

y « x{-% 4-x - X*). 
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Tine characteristic equation for (24-17) is 

m® -f 0. 

Since its roots are m « 0, m *• —3, the genera] solution of (24-17) is 

s/ - + Oje"** + x(-}4 + X - 3^. (24-19) 

To determine the constants so tliat the solution (24-19) satisfies the given conditions, 
we compute 

V'(x) - -3c*e~** -(]4-2x+ 3x*). 

The conditions y(0) — 0 and y'(0) « 1 then demand that 

fl -f (*3 « 0, 

-Bc2 - K * 1. 

Thus, C 2 » — Cl » and hence the desired solution is 

y mn ^ ^ -i- z(-}4 + T - X®). 

We state in conclusion that the trial solution for the more general 
equation 

y" + Piy' + PzV = c**(-4o + AiX -h Anx”) (24-20) 

can be sought in the form 

y == e^^ioo + flix H-h anT'^) (24-21) 

if k is not a root of (24-10), If kiss, simple root of (24-10), the trial solution 
(24-21) must be multiplied by x and, if the root is double, by x^. 

PROBLEMS 


Obtain the genera! solution: 


1. 

V" - 

W + = 


2. y" 

+ 2i/' -1 7/ « x; 


8 . 

!/" + 

57/' + 0^/ * 

e‘; 

4. y” 

— 27/' -f 7/ «= x; 


6. 

(O’- 

~ 1)7/ « 5x - 

- 2; 

6 . (/>^ 

- 1)7/ « e2-(x - 

" 1); 

7 , 

(O - 

1)27/ = xc*; 


8 . (// 

~ 6D -f 9)7/ « 


9. 

D(l) 

+ 9 ) 7 / - 3; 


10. v" 

-f 92/ = x2 ~ 2x 

+1; 

11. 

y" - 

y ““ 


12. y" 

-f ?/ “ 3:® 4- x; 


13. 

y" - 

W + 67/ « 

x»e**; 

14. (D 

— ify « e^(x - 

• 1); 

15. 

(D® - 

- hD + 6)1/ ^ 

= 3x=* -b 4x - 2; 

16. (0* 

-- 5D)y * 3x^ • 

f 4x 


Obtain the solution for each of the following equations satisfying the given conditions: 

17 . y” + by' -f- 41/ » 20e^, i/(0) - 0, t/'(0)-2; 

18 . y" -f j/' = 1 4- 2 j, 3/(0) - 0, b'(0) =* 0; 

18 . y" -b 3/' - 0, 3/(0) - 0, 3/'(0) - 0; 

20. y" + 43/' + 3j/ - X, v(0) - —46, !/'(0) “ 

21 . y" -b 43/’ -b 33/ - 0, v(0) - 0, v'(0) - 0; 

22. v" + 4v' -b 3v - X. iKO) - 1, I/'(0) - 0. 
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2§. The Use of Complex Forms of Soiutious in Evaltiating Particular 
Integrals* The method of determining particular integrals of Eq. (24-9), 
described in the preceding section, can be extended to equations of the 
form 

y" + piV^ + p^y ^ cos nx^ (25-1) 

y" + piv' + p 2 y = sin nx, (25-2) 

in which k may be equal to zero. If we recall the formula 

= cos nx + i sin nx, 

it becomes clear that cos nx and sin nx are, respectively, the real 
and imaginary parts of the function Now, if instead of Eqs. 

(25-1) and (25-2) we consider the equation 

y'" + Piy' + p2y « (25-3) 

and obtain its solution y ^ u + iv, the real part u of such a solution will 
satisfy Eq. (25-1) and the imaginary part v will be a solution of (25-2). 
We illustrate this method of deducing solutions of equations in the forma 
(25-1) and (25-2) by examples, 

Example 1. Find a solution of 

3 /" + y “ 3 sin 2x, (25-4) 

Since «« cos 2x + i sin 2a:, we consider, instead of (25-4), the equation 

y" + y ** 3(cos 2x -f- i sin 2x) «• (25-6) 

The imaginary part of a solution of (25-5) is clearly a solution of (26-4). Equation (25-5) 
has the form (24-9) with k « 2i, and since neither of the root/S of the characteristic equa¬ 
tion -h 1 ■* 0 is equal to 2i, we take the trial solution 

y « 

Now y' « 2ia€^^, 

y” » ( 22 )^ae^** « — 
and the substitution in (25-5) yields 

~4ae^^ -f 

Thus, a « — 1, and consequently y « is an integral of (25-6). The imaginary 

part of — is — sin 2x, and hence a solution of (25-4) is y «* — sin 2x. 

Example 2. Find one integral of 

y" -f y - 3 cos x, (26-6) 

Since c*® » cos x -f f sin a:, we consider 

y" + y « 3(cos x -f f sin a:) m 3s**, (26-7) 

the real part of the solution of which satisfies (26-6). This time k in (24-9) is -hi, and 
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einoe the roots of the characteristic equation are dri, we take the trial solution 

y «* oxe^. 


From (25-B), 

y' w ae^ -f aixe^, 
«• 2atc^ — axe**, 


and the substitution in (25-7) gives 

2aie“ — axe^ -f axe^ «* 
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(25-8) 


Thus, o »* 3/2i ** — and therefore 

y » — »« — ?^ix(cos i i sin x) 

is a solution of (25-7). The real part of this solution is sin x, and we conclude that 
y ** ^ sin X is a solution of (25-6). 

Example 3. Find a solution of 


y'' + 2y' 4- 2y e”* cos x. (25-9) 

Since e“* cos x is the real part of 

c~®(co8 X -{• i sin x) ** « gx(~i+»)^ 

we consider the equation 

+ 2y' -j- 2y « (25-10) 

The roots of the characteristic equation 

4- 2w 4* 2 »« 0 

are m =» — 1 db t, and since one of these roots appears in the exponent in (25-10), w© 
take the trial solution 




Then, 


y' =« 4- ax( — l 4- 

y" « 2o(-l 4 4- ax(-l 4- 

and on making substitutions in (25-10), we find 

1 


so that 

Thus an integral of (25-10) is 
1 


2i 


y a» — xe~ *(cos x 4* f sin x). 

2i 2% 

The real part of this, }ixe~ * sin x, is a solution of (25-9). 

The methods of this and tlie preceding section can be extended to 
equations 

y" + piy' + V 2 y = /W (25-11) 

in which the right-hand member is a sum of several functions of the types 
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considered in these sections, for suppose that f{x) «= Si{x) + so that 
(25-11) reads 

y" + viv' + P 2 y ‘‘ fi(^) + / 2 (*)- (25-12) 

If we consider a pair of equations 

2/" + Piy' +P2y =/i(x), 

(25-13) 

y" + Piy' + P2y = M^), 

and denote the solution of the first of these by y = Wi(x) and that of the 
second by y = U 2 (x), then y = Ui(x) + U 2 (x) will be a solution of (25-12). 
The proof follows at once on inserting y ~ Uy{x) -f- ^ 2 ( 0 :) in (25-12), As 
an illustration of the use of this theorem we consider a simple example. 

Example 4. Find one solution of 

y" y ^ 3 cos X + 1 + 2e®. 

We consider three equations: 

y" 4 - y M 3 cos X, 

J/" + i/ « 1, 

1/" + y * 2e*. 

A particular integral of the first of these, as shown in Example 2, is y » J^x sin x, and 
solutions of the second and third equations are, respectively, y » 1 and y « e^, as is 
clear by inspection. Hence an integral of the given equation is y -» sin x -)- 1 + 


PROBLEMS 

Solve: 

1. (Z>2 - 3i> -f- 2)y - cos 2x; 

2. (£)^ 4* 4)y *» cos 3x; 

3. (// - -* = ~ cos X — 3 .sin x; 

4 . y" 4 5y' -f Gy ** 3^-^^ -f 

5. y*' + 2y' 4" 5y e® sin 2x; 

y^^ — y^ Gy = cos 3x; 

7. {D^ - 25)y - -{-x^ - 4x; 

8. 4* l)y ^ 3 sin 2x - 9 cos 3x. 

Obtain the solution satisfying the conditions y(0) * 0, y'(0) « 0 for each of the fol¬ 
lowing: 

9. y" - y ^ sin x; 10. y'" 4- 2y' 4- 5y * 0; 

11. y'' — 2y' » 6“'' cos x; 12. y" -f t/ » cos x -f 1. 

26. Linear nth-order Equations with Constant Coefficients. The results 
of Secs. 22 to 25 are easily extended to nth-order linear equations 

2/^"’ + -1 - hPny = fix) ( 26 - 1 ) 

with constant coefficients. In dealing with such equations it is convenient 
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to make a systematic use of the operator notation introduced in Sec. 23 
and write (26-1) in the form 

(Z)” + H-f- Pn^lD + Pn)y ~ fix), (26-2) 

The homogeneous equation associated with (26-2) is 

(Z)’* + ‘ + Pn^iD + pn)y - 0, (26-3) 

and if one substitutes in it t/ = there' results 

(m” + H-h Pn-lW + = 0. 

It follows that y = is a solution (<f (26-3) whenever m is a root of the 
characteristic equation 

w” + H-h pn-jm + Pn = 0. (26-4) 

If this equation has n distinct roots 

m — ini, rn — m 2 , ..m = 

then y - y = , y - 

are distinct solutions, and we can conclude (sec Sec. 27) that 

y = H - h (26-5) 

is a general solution in the sense that the arbitrary constants in (26-5) 
can be determined to satisfy the prescribed initial conditions 

Vixo) = 2 / 0 , y'ixo) - y'o, (26-6) 

Since the coefficients in (26-4) are real, the complex roots of (26-4) must 
necessarily occur in conjugate pairs. Thus, if nii = a T hi and m 2 ~ 
a — hi are a pair of such roots, the solutions corresponding to them are 

_ ^“"^(008 hz + t sin 6.r), 

=- e^*(cos bx — i sin hz). 

As in Sec. 22, we prove that the real and imaginary parts of these solutions 
yield a pair of linearly independent real solutions 

y == cos bxj y « c®"' sin bx. 

When the roots of characteristic equation (26-4) are not simple, and if, 
for example, the root mi has the multiplicity fc, then corresponding to it 
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there will be a set of fc distinct solutions/ 

yi = «”>*, to = xc”***, ..to » (26-7) 

The proof of this assertion follows upon making obvious modifications in 
the argument presented in Sec. 23. 

We illustrate these statements by two examples. 

MmmpU 1. Find the general solution of the fourth-order equation 

yrv _ -h 22 /'^ -- 2 j/' + 1 / » 0 , (26-8) 

or (i)^ - 2D» 4- 2D2 - 2D + \)y * 0. (26-9) 

The characteristic equation for (26-8) has the structure determined by the operator in 
(26-9). It is 

~ 2m^ 4 “ 2 m- - 2 m 4 - 1 * 0 . 

On factoring this we get 

(m2 4- l)(m - 1)2 « 0. 

Thus, there are two simple roots mi « t, m 2 =» — t and the double root mg m 4 *■ 1 . 
Solutions corresponding to these roots are 

yi « e**, yt * yz « e^, iji » 

and the general solution is 

y « ric“ 4* ^ 26 ”’** 4- cgc* 4“ r^ixe^. 

This can be written in real form as 

j/ « Cl cos J 4“ C 2 sin a* -h (Cg 4- 
Example 2 . The equation 

(D^ 4- 3/>» 4- 3D2 4 - D)y “ 0 
or D(D 4- 1)V - 0 

has the characteristic equation 

m(m 4" 1)® - 0. 

Accordingly, the general solution is 

y •» 4* 4* 4- 

An argument in every respect similar to that given in Sec. 24 yields the 
result that when y = u{x) is any solution of Eq. (26-1) and y *= Ciyi(x) 

+ C 2 l/ 2 (^) H-h Cnyn{x) is thc general solution of the homogeneous 

equation (26-3), then the general solution of (2()-l) is 

y = Ciyi{x) + C 2 to(a:) d- h c„yn(x) -h u(x). (26-10) 

* If the complex root mi « a 4- W is of multiplicity k, then corresponding to this root 
and to its conjugate m 2 «• a — bi, there will be a set of 2k real solutions: 

e®* cos bXf cos 6x, .. cos bx, 

^ sin fox, x«®* sin fox, ..., sin hx. 



SEC. 26] UNEAR DIFFERENTIAL EQUATIONS 60 

The calculation of particular solutions u(t) by the method of undetermined 
coefficients for functions f(x) of the type considered in Secs. 24 and 25 
follows, with obvious minor modifications, the pattern of those sections. 
Without further ado we illustrate the procedure by examples.^ 

Example 3. Find a solution of 

j/"' 4- y" 4 4 3a: 4 1. (2^-11) 

The left-hand member of this equation contains no y (that is, pz * 0). On recalling the 
statement made for Eq. (24-12), we take the trial solution 

y « x{ao + a\x 4 Oax^). 

On computing the first thr<ie derivativ<*^ we obtain 

y' ao 4 2aiL + 3a2*r^ 
y" =* 2ai -f fk/ox, 

2/'" « fkl2 

Substitution in (20-11) tbrn yields 

(2a{) + 2ai -f (>a2) 4 (0a2 4 4ai)x 4 0a2X^ =* x" 4 3x 4 F 


Hence 0a2 1, 

0(22 4 4(ii 3, 

2 (2 o 2 (2 i -f- 0(22 1 

and we conclude that 

02 =» />6. ^2i Qo ~ 

Accordingly, y « x( —4 4 a solution of (20-11). 

Example 4. Obtain tlu' general soluticm of 

~ 37/ 4 2D)y - 4 OOe^'. (26-12) 

The characteristic ecjuation for (20-12) us 

7n^ — 3m^ 4 2m «= m{m — l)(ni — 2) « 0. 


Thus, the general solution is 

y ^ Cl C2e/ 4 4 w(x), 

wdiere w(x) is some integral of (20-12). To obtain i/(x) it is simpler to add the particular 
integrals of 

(// - 3/)" 4 2D)y = 4, (26-13) 

(D« - 3i>2 4 2D)y « &)e^. (2(4-14) 

For the first of these we take a trial solution y « ax. We find on inserting it in (26-33) 
that a »= 2, so that y =» 2x is an integral of (20-13). The substitution of in 

(26-34) yielfls, after simple cal<*ulation, a *= 1; hence y « is a solution of (20-14). 
Accordingly, an integral of (20-12) is ?/ « 2x 4 and the desired general solution is 

ea Cl 4 4 cze^^ 4 2x 4 e®*. 


' A general method is presented in Sec. 28. 



70 OHDINARV DIFFERENTIAL EQUATIONS (CHAP. 1 

Example 5. Obtain one solution of (/>* + 2J> 4- 7)y « — 24e* cos 2x. The right- 
hand side is the real part of —To solve 

(D* + 2D -f 7)y * (26-15) 

try y « Substitution gives 

(D* 4* 2D + «■ ((1 4 2f)8 4 2(1 4 2?*) 4 

« (_2 4 


where the last equality is stated because we want to obtain a solution of (26-15). 
follows that 


-24 

-2 4 ^ 



12 1 4 I 

1 — 1 1 4 ^ 


6(1 4 0, 


It 


and hence a solution of (26-15) is 


2/ « 6(1 4 ** 6(^(1 4 2 )(cos 2x 4 ? sm 2x) (26-16) 

Since —24fi^cos2x is the real part of a solution of the original problem is 

found by taking the real part of y in (26-16). Thus, 


y » 6e* cos 2x — 6e^ sin 2x. 


PROBLEMS 

Find the general solutions: 

1. (D - 5)(2D 4 3)Dt/ = 0; 


3. (D'^ 4 3D2 4 3D 4 l)y « 0; 

6. (D® - 2D2 4 D)y « 0; 

7. (D^ - k*)y «= 0; 

9. (D^ — D^ 4 4D),v «= 4x 4 e*; 
11, (D - l)(D - 2)hj « t2; 

13. P'ind the solution of y'" 4 2y* 
y'm * 0, y"(0) « -25/2. 


2. (D* 4 1)(D2 4 2D 4 5)i/ * 0; 

4. (D^ 4 8)1/ - 0; 

6. (D* 4 3D» 4 3D^ 4 D)y « 0; 

8. (D^ - 3D‘^ 4 4)y * 0; 

10. (I>^ 4 l)v *= 2oosx; 

12. (D 4 1)(D - 1)(D -2)y - e 

y' — 22/ «* 2c" 4 which satisfies y(0) 


27. General Linear Differential Equations of nth Order. It is not dif¬ 
ficult to extend the considerations of Sec. 21 to a homogeneous nth-order 
linear equation 

4--1- Vn-i{r)y' + p„(a-)3/ = 0 (27-1) 


with variaVile coefficients Vi{x). Word-for-word repetition of the argument 
used to establish properties 1 and 2 of Sec. 21 leads to the conclusion that 

y = ci2/i(x) 4' C2?y2W H-h CkVkix) (27-2) 

is a solution of (27-1), for an arbitrary choice of the constants Ci, whenever 
2/i(x), y 2 i^)} • • 2/ifc(^) is a set of solutions of (27-1). 

A set of k such solutions is said to be linearly independent if the relation 


CiViix) 4- C2.y2{x) 4- hCkVkix) s 0 (27-3) 

holds only when Ci =: C 2 == • • • ~ 0. When a set of constants c*, 
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not all of which are zero, can be found such that Eq. (27-3) is true, the 
solutions yiix) are linearly dependent 

It foIloAvs from the existence and uniqueness theorem of Sec. 1 that 
Eq. (27-1) has exactly n linearly independent solutions y\{x), . * yn{x), 
so that 

y = ciyiix) + C2y2{x) H-h CnVnix) (27-4) 

is a general solution of (27-1), This solution is general in the sense that 
the constants c, in (27-4) can always l>e found, so that there is a unique 
solution of (27-1) for the arbitrarily specified initial values 

v(3-o) = J/o, </'(«o) = ?/o, •••, //'’‘“"(a-o) = (27-5) 


An argument analogous to that used to establish the condition (21-9) for 
linear independence of two solutions leads to the result that the set of n 
solutions {yi(x)]y i ~ 1, n, is linearly independent if, and only if, 
the Wronskian determinant 





2/2 

• • Vn 

/ 

t 

/ 

y\ 

2/2 

► . Vn 

^.in~ 1) 


-,(n 

Vi 

V2 



(27-6) 


doc‘s not vanish for any x in the interval wluTe solutions are sought. 

In contradistinction to the case of linear equations with cor >tant co- 
efheients, no formulas are available for solving general linear equations 
with variable ooeOicients of order 2 or higher. (Vrtam sj)€*cial types of 
such e<]uations, however, have been studied extensively, and as shown 
in Cdiap 2, Sec 12, their solutions may be obtained as power series. 

Just as in Sec. 24, we can show tliat if ^ = n(x) is any solution of the 
nonhomogeneous e(ination 


H - h Vn-ii^)'/ + Pni^)y = f{x), (27-7) 


(hen y = Cii/,(.t) -f r 2 V 2 (x) -I-4- Cnyn(x) + '/(x) (27-8) 


is the general solution of (27-7) whenever the /V,(j) are linearly independent 
solutions of the homogeneous eejuation (27-1) The determination of 
particular integrals of (27-7), as we shall sof» in the next section, is a straight¬ 
forward process provided that the general solution of the associated 
homogeneous equation is known. 


Example 1. Sbow^ that the set of functions t/i «» x, 1/2 »* x^, » x® is linearly Inde¬ 

pendent if X 0. The Wroaskian (27-0) for this set of functions is 


1 2x 3x‘ 
0 2 (u 


2x\ 
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Sm<je does not vanish as long as a; 0 , this set is linearly independent in 

any interval that does not include x « 0 . 

Example 2. Test for linear indep<mdonoe » a;® -f 2 x, ys » x* + x, yz « 2x* — x*. 
We compute the Wronskian for this set of functions: 


W(vi,yi,yz) 


X® + 2x X* -f X 2x® — X® 
2x -f 2 3x® -h 1 6x® - 2x 
2 6x 12x - 2 


0 . 


Since W(yuyi,yz) ■» 0 , the given set is linearly dependent. This implies that a set of 
constants Ci, cj, cs, not all zero, can be found such that 


cm -f C2y2 4- czyz « 0. 

Tliis, in turn, means that at least one of these functions can be expressed linearly in 
terras of the remaining ones. In fact, it is easy to cheek that 1/3 «» 2 i /2 — 3 / 1 . 


PROBLEMS 

Test for linear dependence the following sets of functions: 

1. c^®, I, e*, sinh x; 2. 1 , sin x, cos x; 

3. X® — 2x 4” 5, 3x — 1, sin x; 4. (x 4- 1)®, (x — 1)®, 3x; 

6 . 6 ^®, c®®, a h 9 ^ c 9 ^ a; 6 . e^®, sin x, cos x; 

7, e®, xc®, x®e*. 

28. Variation of Parameters, We proceed to show that a particular 
integral y « u{x) of every nth-order linear equation (27-7) can be cal¬ 
culated by the so-called method of variation of parameters whenever the 
general solution of the related homogeneous equation (27-1) is known. 

To make the procediire clear, we first develop it for the second-order 
equation 

2 /" + + V2{x)y « /(x) (28-0 

and then extend it to the general case of Eq. (27-7). I^t us suppose that 

y == ciVifx) + C 2 y 2 {^) (28-2) 

is the general solution of the homogeneous equation 

y" + + P 2 (^)y = 0, (28-3) 

We shall attempt to find an integral of (28-1) in the form 

y i^i(^)//i(^) + V2{x)y2ix), (28-4) 

obtained from (28-2) by replacing the constants c» by some unknown 
functions v^(x). 

If we substitute (28-4) in (28-0> we shall obtain one equation which 
imposes a condition to be satisfied by two unknown functions vi(x) and 
Since one such condition does not detennine the unknown functions^ 
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we need another equation relating vi and ^ 2 * We shall impose this second 
condition in a way that would tend to simplify the calculation of Vi and V 2 . 
If we differentiate (28-4), we get 

y' ^ (v^y[ + V2y2) + (viVi + 4 ^ 2 ). (28-5) 

Now, the calculation of 2 /" will be materially simplified if vi and are 
chosen so that the expression in the second parentheses in (28-5) vanishes. 
Accordingly, we set 

^lyi + V2y2 = 0 (28-6) 

and take y' - viyi + <^ 22 / 2 * (28-7) 

Then y" « Viyl + ^ 2.^2 + ^hl/i + 42/2- (28-8) 


The substitution from (28-4), (28-7), and (28-8) in the original equation 
(28-1) yields, on rearrangement, 


viiyi + piy'i + P 22 / 1 ) + V2(y2 + pm + ?>2i/2) 




(28-9) 


But since yi and i /2 are known to satisfy (28-3), the expressions in the 
parentheses in (28-9) vanish. We thus get 

ViUi + vm = /(^). (28-10) 


The pair of equations (28-G) and (28-10) can be solved for 4 and 4 to 
yield 


since the determinant 



0 

2/2 


yi 

0 

p ' 

/ 

?/2 

/ 

yi 

/ 

Vi = 


. . . 

V 2 = 

— 



l/l 

y 2 


?/i 

1/2 


yi 

i/2 


yi 

2/2 


(28-11) 


^(yim) 


y\ 112 

lA 2/2 


never vanishes inasmuch as yi and 2/2 are linearly independent solutions of 
Eq. (28-3). 

The right-hand members of Eqs. (28-11) are known functions of x, and 
on integrating them we obtain vi{x) and V 2 ix). We can thus write an in¬ 
tegral of (28-1) in the form 


y 



-fy2 

W{y^,y2) 


dx + y2{x) f 


fVi 

W(2/i,y2) 


dx, 


obtained by inserting Vx{x) and V 2 {x) in (28-4). 


(28-12) 
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Example 1. Find an integral of 


X^V'" ~ 2xy' 4- 2y « a: log 


if a? > 0. 


(28-13) 


It is easily checked that a pair of linearly independent solutions of the homogeneous 
equation associated with (28-13) is yi «* x, y 2 “ Thus its general solution is y » 
Ciz -f c*x*. Accordingly, we seek an integral of (28-13) in the form 


y «* t;jx 4- 


(28-14) 


On dividing (28-13) through by to reduce it to the standard form (28-1), we see that 
f(x) *» (log x)/z. Thus, Eqs, (28-8) and (28-10) yield 


ViX 4" V2Z^ 
ejT 4- %2x 

Solving these for vi and t >2 we obtain 

log a- 

) 

X 


0 , 

logx 


Vi « 




and thus 


r.- -/ 


logs- 


dr, 


logx 
^2 > 


' logx 


V 2 ^ J dx. 


Integrating these and dropping integration constants (for any int-egral will do), we find 


vi - 


V2 


- -(1 4- logx). 
X 


(28-15) 


The substitution from (28-15) in (28-14) yields the desired integral of (28-13) in the form 
y » ~x[l 4- log X 4- }l 2 (logx)®| 

Of course, we could have obtained this result directly from formula (28-12). 

The foregoing procedure can be generalized to compute an integral of 

2 ^^"^ + Pi -I-b Pn~iW?/ 4- p(T)y = fix). (28-10) 

If Viix),, . Vnix) is a known set of linearly independent solutions of the 
corresponding homogeneous equation, we seek an integral of (28-lG) in the 
form 

y ^ Vi(x)yi(x) + V2{x)y2{x) 4-f~ Vn(x)yn(x), (28-17) 

where the v,(a:) are unknown functions. To determine them we form the 
set of n — 1 equations by equating to zero the terms involving the v[ (x) 
in the expressions resulting from differentiating (28-17) successively n — 1 
times. The nth equation is got by inserting the corresponding values of 
derivatives in (28-10). We illustrate the procedure by an example.^ 

* Sec also Prob. 5 at the end of this section. 



SEC. 28] LINEAR PIITERENTIAL EQITATIOlte 75 

Example 2. Find an integral of 

111 

y"’ + ^ V' - ^ V “ ^ log *. X 9^0. (28-18) 

A set of linearly independent solutions of the corresponding homogeneous equation is 
known to be * 

yi ^ X, yn ^ X log x, yz « a:(log x)*. (28*19) 

Accordingly, we take the integral of (28-18) in the form 

V « vix -f t> 2 X log X -f t> 5 x(Iog x)l (28-20) 

For the third-order equation the procedure just sketched yields the system of three equa¬ 
tions: 

v[yi -F f>2y2 + * 0, 

^iVi -f v^y2 + •» 0, (28-21) 

VlVl +^2^2 4-«^3y3 “/(^). 

The reader will verify that, on setting/(x) = (1/x^) logx and noting (28-19), the sys¬ 
tem (28-21) yields 

v{ « ~ (log x)^ ~ ^ H ^ log *2 

and we can take 

t'l - H(logx)^ V 2 - --HGogx)*, t;i - K(logx)* 

Substitution in (28-20) gives finally y » (x/24)(log x)^. 

PROBLEMS 

1. Use the method of variation of parameters to find integrals of the following equa¬ 
tions with constant coefficients. 

(a) y' 4- 3]y ^ x*'*; (c) y'' - 2y' -f y « x; 

(b) y" 4- by' -F i>y ^ e*; (d) y'" - 3y' 4- 2y « 2{8in x - 2 cos x). 

2. Find the vSolution of 

^ 4“/i(x)y «/2(x) 

by the method of variation of parameters, and compare your result with that of Sec- 
10. The solution of the related homogeneous equation is obtained easily by separation 
of the variables 

8. By the method of variation of parameters, find a particular integral of 

3 dy 5 

-- 5 y ** log X, 

dx^ X dx X® 

whore the general solution of the related homogeneous equation is 

6 

y *■ —1~ C2X\ 

X 

* See Example 1, Sec, 30. 
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4 . find tile general solution of 

s dy _ 1 

dz* 1—idx 1— 


— X, 


where the general solution of the related homogeneous equation is cj«* 4- e»x. 

6. Show that the formula corresponding to (28-12) for an integral of (28-16) is 


!/(*) 


' 2 j 


WiiVuVi,-- -.Vn) 

W{yi,y2,.. .,Vn) 


f(x) dx, 


where , .,yn) is the Wronskian and TKg is the determinant obtained from W 

by replacing the fth column by (0,0,0,,. .,1). 


29. Reduction of the Order of Linear Equations. The method of vari¬ 
ation of parameters can be used to reduce the solution of every nth-order 
linear homogeneous equation to the solution of a Hne^ar equation of order 
n — 1 when one solution of the nth-order equation is known. This matter 
is of some importance in deducing general solutions of second-order linear 
equations, because one integral of such equations can often be determined 
by inspection. 

Let yi{x) be a solution of 


y" -f pi(x)y' + p 2 (x)y ^ 0, (29-1) 


so that y ~ cyi(x) is a solution for any constant c. If we replace c by an 
imknown function v(x) and seek a solution of (29-1) in the form 


y « v{x)yi{x), (29-2) 

we get, on differentiating (29-2), 

y' ^ vyi + v't/i, (29-3) 

y" *= m + 2rVi + 


Substituting from (29-2) and (29-3) in (29-1) and noting that yiix) is a 
solution of (29-1), we get a separable equation 




for v{x). 

Separation of variables in 


+ v'(2y\ -h piyi) = 0 

(29-4) gives 


so that 




Vi 


-2 log 2/1 -1 Pi dx. 


(29-4) 


Hence 


2/1 


g-fPt is 


We see that v^{x) 9 ^ 0, so that v ^ const. 


(29-6) 
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29] 

Integrating (29-5) we obtain 

V =*= / dx, (29-6) 

so that the second linearly independent solution of (29-1), by (29-2), is 
1/2 = yx{^)dx, (29-7) 

We dispense with quite analogous calculations showing that the solution 
of an nth-order linear equation can be reduced to that of a linear equation 
of order n — 1 when one integral of the nth-order equation is known. 

Example ‘ The equation 


with X 7 ^ kl has an obvious solution vi x. I'o determine another solution we set 
y a» vx. The function r, determined by formula (29-0), is 


V 


j, -2/0^ J-,) 

log (2x-l) 


dx 


X 





dr 



X 

Thus the second solution is y « cj =* 


PROBLEMS 


1. The equation x'^y" + 2j* 7/' » 0 has an obvious solution 
is another solution, and thus find the general solution. 

2. One solution of 


1 /" + ■ 


~ 2x 


X^ + X 


X^ -f X' 


V 


y 


1 . Show that y 


0 


l/x 


obviously is y «» x®. Show that a st'cond solution in y ** xc'^^. 

3. A special case of I^>gendre's equation 

(1 - x^)y" - 2xy' + 2y ^ 0 

has an obvious solution y ^ x. Obtain a first-order equation for a second linearly 
independent solution of this equation, and solve. 
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SO. The Eoler-Catichy Equation. An equation of the form 

■ 4- an-ixy' + a„y « /(*), (30-1) 

where the a*- are constants, is usually called Cauchy’s equation, although 
it was examined earlier by Euler. We show that by a change of the in¬ 
dependent variable x, it can be transformed into an equation with constant 
coefficients which can be solved by familiar methods. 

If we set 

X == (30-2) 

dx dz 

then — =a e* and — =■ 

dz dx 


On writing D » d/dz, we get 


I/' 



dy dz 
dz dx 


= c " Dy, 


Also 


dy^ 

dx 


dj/ dz 
dz dx 


W)e^ 


= e-2‘(Z)2 - D)y 
= e-^’D(D - l)y. 


In a similar way we find 

yC«) = - 1){D -2) ■■■ {D-n+ l)y. (30-3) 

From (80-2), = e*", and the substitution from (30-3) in (30-1) there¬ 

fore yields the equation with constant cocflieients 


\DiD - 1)(Z) - 2) • - • (D - n -f I) 


-f aiI){D — 1) • • • (D — n -f 2) 4-1- a„_iD 4- «„]!/ = /(c'). (30-4) 

If a solution of (30-4) is denoted by y = F{z), then the solution of the 
original equation, as follows from (30-2), is y = F(log x). 

Example 1, Find the general solution of 

**»"' + xy' - y - xlcygx. ( 30 ^) 

Upon setting a; this equation becomes 

fIXX> - IXD ~ 2) 4- /> - lly « 

or (D* — 3Z)* -f 3D — l)y » se*. (30-6) 

The roots of the characteristic equation obviously are mj ** rat «■ mi «« 1. Hence the 
solution of the homogeneous equation is (ci -j- -f c*a 5 *)e*. 
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Inasmuch as the characteristic equation has a triple root and the right-haud member 
of (30-6) is a solution of the homojseneous 0 (]uation, we take the trial integral in the form 
(se»e Sec. 27) 

y * az^e^. 

The substitution of the trial integral in (30-6) shows that a =»» 3 ^ 4 , so that the general 
solution of (30-6) is 

2/ « (ri 4- C20 4- 4- 

P'inally, the substitution of 2 » log x gives 

2/ « (ri 4 - r2logar 4- raflogx)^ “f M4^(logx)^ 
which is the desired solution of (30-5). 

The general solution of the homogeneous equation 

^Oyin) -I-1. an-iXy' + a„(/ = 0, 

associated with (30-1), can often he found by taking a trial solution ij = 
This is illustrated in the following example. 

Example 2. Solve x^i/" 4- 2x|/ » 0 Tlie substitution of y ~ yields the equation 
m(w — l)x"‘ 4* 27rix"‘ =» 0, 
or rnirn — 1) -f 2m ^ 0. 

Since m « 0 and m ^ —1 .satisfy this equation, y ^ x^* ~ 1 and y « x“^ are linearly 
independent solutions of the given eipiation The general s<j|ution, tlu*r-efore s 

y « n 4- r2X*‘k 


PROBLEMS 

Find the general solutions of: 

1. 4- 4x1/' + 2y log x; 2. x^y'" — 4x“i/" 4* 5xi/' — 2v » 1; 

3. Tv- - 2x?/' T 2// — x logx. 

By assuming a solution of the form ^ = x*” solve: 

6. x'^y'* — ixy' 4- by « 0; 6. x^y" T 2xy' — n(n T 1)!/ « 0. 


APPLICATIONS OF LINEAR EQUATIONS 


31. Free Vibrations of Electrical and Mechanical Systems. 

Sec. 19 that the equation 


d{mv) 

dt 


F{s,v,t), 


We saw in 


(31-1) 


stating Newton's second law of motion, is readily integrable when the 
external force F is a function of the displacement s alone, when it is a 
function of the velocity v alone, or when it depends only on the time L 
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In this section we examine other types of this equation which are of cardinal 
importance in the analysis of oscillating electrical and mechanical systems. 

Throughout this discussion we shall assume that tlie mass m is constant, 
so that Eq. (31-1) can he written as 

d^s 

where v = ds/dt. 

We begin our study with a simple mechanical system which is a proto¬ 
type of more general systems that appear in the analysis of vibrations of 
elastic structures. 

IjCt it be required to determine the position of the end of an elastic 
spring set oscillating in a vacuum. If a mass M is applied to one end of 
the spring whose other end is fixed, it will produ(‘e the <dongation s, which, 
according to Hooke’s law, is proportional to the 
applied force F = Mg, g being the gravitational 
acceleration. Thus, 

F = ks = Mg, 

where k is the stiffness of the spring. 

If at any later time t an additional fore.e is ap- 
plied to produce an extension ij, after which this 
additional force is removed, thr' spring will start 
oscillating. The problem is 1o determine the posi¬ 
tion of the end point of the spring at any siibst*- 
quent time. 

The forces acting on tla^ mass M are the force of 
gravity Mg downward, which will be taken as the 
positive direction for the displacement y, and the 
tension T in the s[)iing, which acts in the direction 
opposite to that of the force of gravity (I’ig 10). Hence, from Newton's 

second law of motion, ^ 

d V 

M-^ - Mg T. 
dr 

Since T is the tension in the spring when its elongation is s + i/, Hooke's 
law states that T ^ k{8 + y), so that 

<Py 

M-~^Mg-^k{B + y). 
dr 

But Mg ^ ks, and therefore the foregoing equation becomes 

d^y 

M — + ky ^ Q. 

dr 
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Setting h/M » reduces this to 

^ + a^ 2 / « 0 or {D^ + a^)y ^ 0. 
dr 
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(31«2) 


Factoring gives (I) 
general solution is 

or, in real form, 


ai)(D -f ai)y = 0, from which it is clear that the 
y « + C 2 C®*^, 

y — A cos at + B sin at 


The arbitrary constants A and B can be determined from the initial 
conditions. The solution reveals the fact that the spring vibrates with a 
simple harmonic motion whose period is 



"df\ 


The period depends on the stiffness of the spring as w^ould be expected— 
the stiffer the spring, the greater the frequency of vibration. 

It is instructive to compare the solution just obtained with that of the 
corresponding electrical problem. It w'ill be seen that a striking analogy 
exists between tlie mechanical and electrical 
systems. This analogy permits one to replace 
a study of complicated mechanical systems by 
the analysis of performance of mathematically 
equivalent simple electrical circuits. 

Let a condenser (Fig. 17) be discharged 
through an inductive coil of negligible resist¬ 
ance. It is knowm that the charge Q on a con¬ 
denser plate is proportional to the potential dilTerence of the plates; that is, 

Q = CF, 


L 

Fro, 17 


where C is the capacity of the condenser. Moreover, the current I flowing 
through the coil is 



and, if the inductance be denoted by L, the emf opposing F is L dl/di^ 
since the IR drop is assumed to be negligible. Thus, 
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Simplifying gives ^ 

which is of precisely the same form as (31-2), where (j? « 1/CL, and the 
general solution is then 

i i 

<3 == A cos — 7 ==== + B sin —r— • 

VCL VCL 

The period of oscillation is 

T = 2jrV^. 


Note that we can make the inductance L correspond to the mass M of the 
mechanical example and 1/C correspond to the stiffness k of the spring 

32. Viscous Damping. Let the spring of the mechanical example of 
Sec. 31 be placed in a resisting medium in which the damping force is 
proportional to the velocity. This kind of dami)ing is termed viscous damp¬ 
ing. 

Since the resisting medium opposes the displacement, the damping force 
r(dy/clt) acts in the direction opposite to that of the displacement of the 
mass ikf. The force equation, in this case, becomes 


M 


or, since Mg =» ks, 


d^y 

TT ^ ^9 - Hy + s) - r 
dr 

d^y r dy k 


dy 

dt 


To solve this equation we write it in the more convenient form 


d^y 


+ 26 y + d^y 
dt 


0 , 


(32-1) 


where 26 = r/M and = k/M, In this case the characteristic equation 
is 

-f 26m + = 0 

and its roots are 

m « -6 =fc \/b^ -- , 

so that the general solution is 

y -= cie^~>>+^i‘*-o' )t ^ )t ^32-2) 

It will be instructive to interpret the physical significance of the solution 
(32-2) corresponding to the three distinct cases that arise when 6'-* — > 0, 
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= 0 , and 5^ — < 0 . If is positive, the roots m are 

real and distinct. Denote them by mi and m 2 , so that (32-2) is 

y = + C 2 e^^K (32-3) 

The arbitrary constants c*i and C 2 are determined from the initial conditions. 
Thus, let the spring be stretched so that y — d and then released without 
giving the mass M an initial vekxdty. The conditions are then 


when i » 0 and 


y ^ d 


d^ 

dt 


- 0 


when < = 0 . 

Substituting these values into (32-3) and the derivative of (32-3) gives 
the two equations 


d — Cl + C 2 and 0 =* wiCi + W 2 C 2 . 
These determine 

W2d mid 

cj 3 =-and C 2 -- 

mi m2 mi — m2 


Hence, the solution (32-3) is 

d 

y - -(7nic”*2^ — 

nil — m2 


The graph of the displacement represented as a function of t is of the type 
shown in Fig. 18. Theoretically, y never becomes zero, although it comes 


arbitrarily close to it. This is the 
so-called overdarnped cas<;. The re¬ 
tarding force is so great in this caK(‘ 
that no vibration can occur. 

If 6 ^ = 0, the two roots of 

the characteristic equation are equal 
and the general solution of (32-1) be¬ 
comes 

y = + C 2 O. 



If the initial conditions are y ^ d, dy/dt = 0 when t = 0, the solution is 

y =3 ^^“-^'(1 + ht). 


This type of motion of the spring is called deadbeat. If the retarding force 
is decreased by an arbitrarily small amount, the motion will become 
oscillatory. 
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The most mteresting case occurs when 6^ < a^, so that the roots of the 
characteristic equation are imaginary. Denote by so that 


w » — 6 =fc: fa 
and y « 

ae cos a^ + sin at). 

If the initial conditions are chosen as before, 


when t « 0 and 


dy 

dt 


when < 0, the arbitrary constants A and B can be evaluated. The result 


IS 


y * de ^cos at — sin » 

which can be put in a more convenient form by the use of the identity 

.4 cos d A~ B sin B s cos ^ — tan""^ • 

The solution then appears as 

2 / « ~ Va^ + cos (at - tarr^ • (32-4) 

a \ a/ 


The nature of the motion as described by (32-1) is seen from Fig. 19. 
It is an oscillatory motion with the amplitude decreasing exponentially. 

The period of the motion is T - 2 ir/a. 



An electrical problem corresponding to the example of the viscous 
damping of E spring is the following: A condenser (Fig. 20) of capacity C 
is discharged through an inductive coil whose resistance is not negligible. 
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Referring to Sec. 31 and remembering that the IE drop is not negligible, 
we find the voltage equation to be 


or 

Simplifying gives 


V 


L -JK =. 0 

dl 


Q d^Q dQ 

~ + L-i-i-E~ 

C dt^ dt 

d^Q R dQ Q 


= 0 . 
= 0 , 


I— 

L 


and this equation Is of th(‘ same form us that in the mechanical example. 
The mass M corresponds to the induct¬ 
ance Ly r corresponds to the ele(*tricul 
resistance it*, and the stiffness k cor¬ 
responds to 1/C. Its solution is llie 
same as that of tfie corrt‘spoTKling me¬ 
chanical example and is obtained by 
setting 2/> == R/L imd ~ 1/CL. Pig 20 


VWW-' 

R 


PROBLEMS 


1 . The force of 1,000 dynes will stretcli a spring 1 cm A ma?!' of 100 g is suspended 
at the end of tlje spring and set vibrating. Pmd the e{}Ujition of motion and the irequeney 
of vibration if the luasH is |)uljed <lown 2 cm an<l then reU'ascd What a dl be the solution 
if the mass is projected down from rest with a velocity of 10 cm per s(*c? 

2. Two equal masses ate suspended at the end of an elastic spring of stiffness k. One 
mass falls off llcscribe Ihe motion of the remaining nia-ss. 

3. The force of 98,000 dynes extends a spring 2 (an. A mass of 200 g is suspended at 
the end, and the siiring is pulled down 10 cm and released Find the position of the 
mass at any malant / if the re.‘-i.stance of the medium is neglected. 

4. Solve Prob 3 under tlie assumption that the sjiring is viscously damped- It is 
given that the resistance is 2,fK)0 dynes for a vidocity of 1 cm per s(‘c. What must the 
resistance be in order tiiat the motion be a deadbeat? 

6, A coridimser of capacity 4 ^f is cliarged so that the potential difftTonce of the plates 
is i(X) volts The ('ondenser is then discharged through a cod of lesistance 500 ohms 
and inductance 0 5 henry. I'lnd the potential differeiico at any later time L How large 
must the ri'sistance be in order that the discliarge just fails to bi‘ o.scillatory? Determine 
the potential difference for this case. Note that the equation in this case is 



, dV V 

+ %- + c 


0 . 


8. Solve Prob. 5 if i2 « 100 ohms, C =» 0.5 pef, and L *=» 0 001 hen^>^ 

7. A simple pendulum of length I is oscillating through a small angle ^ in a medium 
in which the resistance is proportional to the velocity. Show that the differential equa¬ 


tion of the motion is 


dt^ 


do a 

+ 2.- + ?e 


0. 


Discuss the motion, and show that the period is 2r/Vw* — where •* gfi. 
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8. An iceboat weighing 500 lb is driven by a wind that exerts a force of 25 lb. Five 
pounds of this force is expended in overcoming frictional resistance. What speed will 
this boat acquire at the end of 30 sec if it starts from rest? EifU: The force producing 
the motion is F » 25 — 5 » 20. Hence, 500 dv/di « 20fiF. 

9, A body is set sliding down an inclined plane with an initial velocity of t\> fps. If 
the angle made by the plane with the horizontal is d and the coefficient of friction is 
show that the distance traveled in t sec is 


s «* 3''^(8in B — fA cos 0)1^ -f VqI, 


Hint: m d^s/dt^ « mg sin B — firng cos B, 

XO. One end of an elastic rubber band is fastened at a point F, and the other end 
supports a ina.sR of 10 lb. When the mass is suspended freely, its weight doubles the 
length of the band. If the original length of the band is 1 ft and the weight is dropped 
from the point F, how far will the band extend? W'hal is the equation of motion? 

11. It IS shown in books on strength of materials and elasticity that the deflection of 
a long beam lying on an elastic base, the reaction of which is proportional to the deflec¬ 
tion y, satisfies the differential equation 


El 


dx^ 


^ky 


8et «« kfAEl, and show that the characteristic equation corresponding to the result¬ 
ing differential equation is « 0, wliose roots are m db at. Thus show 

that the general solution is 


y « cos ax -f sin ax -f cos ax + sin ax. 


18. If a long column is subjected to an axial load E and the assumption that the curva¬ 
ture is small is not made, the Bernoulh-hluler law gives (see Sec. 5) 


P 



y 

Fig. 21 


(Py/dx’^ M 
(1 + Wdxpp " El' 

Since the moment M is equal to — Py (h'lg. 21), it follows upon setting 
dy/dx «» p that the differential ecjualion of the deformed central axis is 

pjdp/dy) ^ ^ Py 
(1 ” >;/' 

Solve this differential equation for p, and show that the length of the 
central line is given by the formula 



where Jk® *» d*F/4F/, d is the maximum deflection, and B\k,v/2) is the elliptic integral 
of the first kind (see Eq. (20-12)1. The equation of the elastic curve, in Urn ease, cannot 
be expressed in terms of elementary functions, for the formula for y leads to an elliptic 
integral See, how'ever, Chap. 2, Sec. 10. 

S3. Forced Vibrations. Resonance. In the discussion of Sec. 32, it was 
supposed that the vibrations were free. Thus, in the case of the mechanic>al 
example, it was assumed that the point of support of the spring was station- 
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ary and, in the electrical example, that there was no source of emf placed 
in series with the coil. 

Now, suppose that the point of support of the spring is vibrating in 
accordance with some law which gives the displacement of the top of the 
spring as a function of the time say x « /(O, where x is measured positively 
downward. Just as before, the spring is supposed to be supporting a 
mass Mj which produces an elongation 8 of the spring. If the displace¬ 
ment of the mass M from its position of rest is y, it is clear that when the 
top of the spring is displaced through a distance x, the actual extension 
of the spring is y — x. If the resistance of the medium is neglected, the 
force equation is 

dPy 

M —= Mg - fc(s 4- 2/ ~ -Ky - ^), 
dr 

whereas if the spring is viscously damped, it is 

d^y dy 

M Mg-k{s + y~x) -r-f- 
dr dt 


Upon simplification this last equation becomes 


M 


dt^ 


dy 

+ r = kx, 

dt 


(33-1) 


where x is supposed to bo a known function of t. 

The corresponding electrical example is that of a condenser (Fig. 22) 



which is placed in series with the source of emf and discharges through a coil 
containing inductance and resistance. The voltage equation is 

dl 

dt 

where f(t) is the impressed emf given as a function of L Since 
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the equation becomes 


dV 

C£,~ + Ci?- -f F 
dl^ dt 


-m. 


[chap. 1 
(33-2) 


An interesting case arises when the impressed emf is sinusoidal; for example, 

/(O = Eo sin lot. 


Then the equation takes the form 


RdV 1 
L~dt 


1 

V = — Eo sin 0 L 


Bath (33-1) and (33-2) are nonhornogencous linear eciuations with 
constant coefficients of tlio tyjic 

(^y di/ 

T" + t; + “ 

d(r dt 

The solution of this e((uation is the sum of Ihe cornplemontary function 
and a particular intep;ra]. The eompIcnH*ntur>" function has the form 
(32-2), namely, 

where 

irii =: —6 + and ^2 — —b — . 

A particular intej^ral y == ii(i) can be de(luced for K(p (33-3) for an arbi¬ 
trary continuous function f(0 i>y the method of vanalion of })anua(‘l('rs ’ 
If the impressed force f{t) in (33-3) is simple harmonic of period 27r^ u) 
and amplitude u,), then 

/(/) = Go sin u)t 

and an integral y — u{i) can he obtained by the method of Sec. 25. The 

result is ^ 

o^ao 

_ 26 u) 

where € ~ tan --;• 

— 0)^ 

From discussion in Se<\ 32, it is clear that the part of the general solut ion 
of (33-3) which is due to free vihratioms is a dccrciasing function of t, 
becoming negligibly small after a eufficiont lapse of time. Thus, the '^steady- 
state solution'^ is given by the particular integral (33-4). 

* 8ee the corresponding computation at the end of this section for the case when b m 0. 
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It must be observed that when the impressed frequency o> is very high, 
the amplitude of the sinusoid in (33-4) is small. When w is nearly equal 
to the natural frequency a, the amplitude is nearly a()a/2h. This may be 
dangerously large if the resistance parameter b is small. For a and b 
fixed, the maximum amplitude occurs when 

~{(a2 -0,2)2 + 46V] «0, 

do) 

that is, when 

J = a^~ 262 . ( 33 ^) 

Stated in teims of the physical ({uantitles of electrical and mechanical 
examples, a large amplitude in (33-4) mean.s a large maximum emf, or a 
large maximum displacement of the spring. These, as we have already 
i)oR‘d, may become excessively large whiai the resistance r of the medium is 
.small and the impressed fretpieiu'y w is (‘lose to the natural frequency a. 
I'his phenomenon, known as resonance, is of profound importance in 
numerous engiru^ering and physical situations.^ 

If 6 — 0, Eq. (33-3) reduces to 

d^y 

-- + a^y = a^m. (33-6) 

at 

Wc can easily deduce a formula for an integral y{t) of (33-6) for an nbitrary 
forcing function J{t). Since sin at and cos at are linearly independent 
solutions of Eq. (33-6) witli f{i) — 0, the rn(*thod of variation of param- 
et(Ts (.)f Sec, 28 suggests taking a solution in the form 

y{i) ~ vi{t) cos at + V 2 {t) sin aL (33-7) 

For the determination of vi{t) and V 2 {t) we have a pair of equations [see 
(28-6) iuul (28-10)] 

v'l cos at + V 2 sin at ~ 0, 

~av\ sin at + ai ’2 cos at = 

Solving these for vi and we get 

v\ == —af{t) sina^, i4 = «/(0 cos af, 
which on integration between the limits 0 and i yield 

— a f f[t) sin at dt, V 2 {t) ~ a f f{t) cos at dt, 

Jo Jo 

^ The failure of tlie Tacoma bridge was explained by some authorities on the basis of 
resonant forced vibrations, and there are instances of the collapse of buildings induced 
by the rhythmic sw'aying of dancing couples. The failure of propeller shafts is often 
attributed to forced torsional vibrations. See also Joshua 6:5. 
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Formiila (33-7) then yields 

y{i) « — a cos atj^f(k) sin a\d\ + a sin atj^f(\) cos a\ dX, (33-8) 

in which we have replaced the integration variable / by X so as not to con¬ 
fuse it with the variable t in the limits. It follows directly from (33-8) 
that the integral y(t) corresponds to the initial conditions y(0) ^ i/'(0) « 0, 
If we combine integrals in (33-8), we get the desired formula 

y(t) ^ a f /(X) sin a{t — X) dX. (33-9) 

Jo 

When the forcing function f(J) is taken in the form fit) = ao sin at (so that 
the impressed frequency is equal to the natural frequency), this formula 
yields 

yii) =* ooo f sin a\ sin ait — X) dX. 

Jo 

After simple integration we obtain 


y — — (sin at — at cos at)^ 
2 


representing a vibration whose amplitude increases with time, for the 
amplitude ao/2 in the first term is constant and the amplitude of the 
second term, aQai/2, grows with t. In any physical situation, some resist¬ 
ance is present, and a reference to (33-4) shows that h prevents oscillations 
from becoming arbitrarily large. Nevertheless, they may be dangerously 
large if h is small and a is near w. 


PROBLEM 

Obtain a formula for a particular integral of Eq. (33-3) analogous to (33-9), and deduce 
from it the result (33-4). The integration will hti simplified if sin cd is replaced by 

34. The Euler Column. Rotating Shaft. It is known from experiments 
that a long rectilinear rod subjected to the action of axial compressive 
forces is compressed and retains its initial shape as long as the compressive 
forces do not exceed a certain critical value. Upon gradual increase of the 
compressive load P, a value of P — Pi is reached when the rod buckles 
suddenly and becomes curved. The deflections of rods so compressed 
become extremely sensitive to minute changes of the load and increase 
rapidly with the increase in P. A detailed analysis of this instability or 
buckling phenomenon depends on rather delicate considerations in non¬ 
linear theory of elasticity. However, if the argument of Euler is followed, 
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it is possible to deduce the magnitude of the critical load Pj from linear 
differential equations governing small deflections of loaded rods. 

Thus, consider a rod of unifonn cross section and length compressed 
by the forces P applied to its ends (Fig. 23). Initially this rod is straight, 

ky 



Fig. 23 


but after the critical load Pi is reached, it becomes curved, and we denote 
the deflections of its (;entral line by y. 

It is known from the Bernoulli-Euler law (5-2) that for small deflections 

d^y M 

^ W 


where, in our case, the bending moment M = —Py. Thus 



<Py Py 

~eT 


or 

y" + = 0, 

(34-1) 

where 

III 

(34-2) 

Equation (34-1) must be 

solved subject to the end conditions 



3 

11 

p 

s 

11 

p 

(34-3) 


since the ends of the rod remain on the x axis. 

The boundary-value problem characterized by Eqs. (34-1) and (34-2) is 
quite different from the initial-value problems considered heretofore. In 
the initial-value problems we seek solutions of differential equations 
satisfying specified conditions at one point only, while in the boundary- 
value problem stated above the solution y must satisfy conditions (34-3) 
assigned at hvo points x = 0 and x = i. It is not obvious that a solution 
of a differential equation satisfying specified conditions at two points 
exists in general. We shall see, however, that for suitable choices of the 
parameter k Eq, (34-1) does have solutions vanishing at the end points 
X 0, X =* i. 
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Now the general solution of (34-1) is 

y — Ci cos A:x 4" C 2 sin kx (34-4) 

md, on imposing the conditions (34-3), we get two equations 
0 = Cl cos W) 4* C 2 sin fcO 
0 = Cl cos hi 4“ C 2 sin kl, 

rhese demand that 

Cl « 0, C 2 sin kl — 0. (34-5) 


The choice Ci =5 C 2 = 0 gives 2 / — 0, corresponding to the rectilinear shape 
of the rod. If the rod does not remain straight, C 2 9 ^ 0, and we conclude 
from (34-5) that sin kl = 0, so that 


mr 

= —, 71 = 0,1,2,.... (34-6) 

The choice of n = 0 again gives y — 0. Ifn=l,A: = r/l, and on recalling 
the definition (34-2), wc see that the corresponding value of P is 

Pi-EI— (34-7) 

V 

This is the cnticaL, or the Eulers load. 

The shape of the central line of the rod, in this case, is 


TTX 

y = C2Siny 

The choice of n = 2, 3, ... in (34-6) gives other ‘^critical loads’' P 2 , P 3 . 
... and the corresponding .solutions 

ntrx 

y = C2sm“* 

The maximum deflection C 2 is not detonnined in this analysis, and, indeed, 
no far-reaching conclusions should be made from such calculations inas¬ 
much as they are based on the assumption of small deflections implicit 
in our use of the Bemouili-Euler law. 

Another interesting problem, essentially of the same sort, arises in the 
study of rotating shafts. It has been noted that when a long shaft sup¬ 
ported by bearings at x = 0 and x = Z is allowed to rotate, its initially 
rectilinear shape is preserved only if the speed of rotation 0 does not 
exceed a certain critical value 01 . On approaching the speed 01 the shaft 
starts pulsating and its shape changes. On further increase of the speed 
another critical value 02 is reached when the shaft starts Ideating and its 
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shape changes again, and so on. This phenomenon can, in part, be explained 
by calculations similar to those used in determining the Euler load. 

Let us suppose that the shaft is rotating with the angular speed w. An 
element of length dx of the shaft experiences the centrifugal force 

F dx ^ pdx 

where p is the density per unit length of the shaft and y is the deflection 
at the point x. Thus, 

F » pJ^y (34^) 


is the force per unit length of the shaft distributed along its length. It 
is shown in books on strength of matc^rials that when the forces F acting on 
a rod are normal to its axis, then 


F - 


d^M 

dx^ 


where the bending moment M is given by the Bemoulli-Euler law 


M El 


d^y 

d?* 


(34-9) 


Thus, 



(34-10) 


and if the flexural rigidity El is constant, Eq. (34-10) reads 

d*y __ F 


(34-11) 


The substitution for F from (34-8) gives the desired equation for the 
rotating shaft; 

d^y . 

--k^y^^O (34-12) 


dx^ 


with 


po) 

w 


(34-13) 


Since the roots of the characteristic equation = 0 are rn — db/c, 

m = dtH, tJie general solution of (34-12) is 

y = cie^^ -h + Cz cos kx + C 4 sin kx. (34-14) 

If at the points of support x — Q, x — I the deflection y and the moment 
ill are zero, then [see (34-9)] 

y(0) - 0 , ?/'( 0 ) « 0 , 

2/(0 * 0 , 


2/"(0) « 0, 
2/"(0 - 0 . 


(34-15) 
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The substitution from (34-14) into tlie boundary conditions (34-15) 
yields four equations: 

Cl + C2 + Cg *= 0, 


+ C2 — ca = 0, 

CiC*^ + + C 3 cos kl + Ci sin Id == 0, 

CjC** + C 2 C~‘*^ C 3 cos kl — C 4 sin kl = 0 . 


(34-16) 


The solution Ci ~ C 2 *= C 3 = C 4 *= 0 , yielding y - 0 , corresponds to the 
straight shaft. The system (34-16) also has nonzero solutions for certain 
values of k. From the first two equations (34-16) we find 

Ci = ~C2, C3 = 0, 

and the substitution of these values in the two remaining equations gives 
Cj = C 2 == C 3 = 0 , C 4 sin kl =e 0 . 

Thus, sin W = 0 unless C 4 = 0, and hence 


k = —* n « 1, 2,_ 

I 

Using the value of for n » 1 in (34-13) gives the first critical speed 

jm 

7 Vt* 

The critical speeds W 2 , ^ 3 , ... are determined by taking k with n = 2 ,3,_ 


PROBLEMS 


1. When a beam lies on an elastic foundation, then in addition to the transverse ex¬ 
ternal load F(x), there is a restoring force R =* — 0 ^ 2 / proportional to the deflection y. 
The equation of the axis of the beam then has the form 

+ aV “ F{x). 

Solve this equation for F(x) ■« p, a constant, by assuming that the ends of the beam are 
hinged so that 

y(0) « p"(0) « KO « F'W « 0. 


2. The differential equation of the deflection y of the truss of a suspension bridge has 
the form 


where H ■» horizontal tension in cable under dead load q 
h * tension due to live load p 
E *» Young's modulus 



SYSTEMS or BQDATIONB 


SEC. 35] 


98 


I «• moment of inertia of cross section of truss about horizontaL asds of truss 
through center of gravity of section and perpendicular to direction of length 
of truss 

Solve this equation under the assumption that p — qh/H is a constant. 

3. The differential equation of the buckling of an elastically supported beam under an 
axial load P has the form 




0 , 


where El is the flexural rigidity and k is the modulus of the foundation. Solve this 
equation. 


SYSTEMS OF EQUATIONS 


35. Reduction of Systems to a Single Equation. We saw in Sec. 13 that 
it may prove advantageous to reduce the solution of a second-order equation 
to the solution of a system of two equations of first order. Thus the 
dynamical equation considered in Sec. 31, 


de 


F{s,s',t) 


with s' s ds/dtj can be reduced to a system of two equations, 


— = F(s,t;,0, 

at 



by setting s' = v. 

In the same manner, the third-order equation 


^ =* F(yy,y'\t), ( 35 - 1 ) 

in which y' s dy/di and y" s d^y/d^^ is reducible to a system of three 
first-order equations in Xi, ^ 2 , defined by 


y = y' ^ ^ 2 , y" = 


With these definitions, Eq. (35-1) can be replaced by a ^stem of three 
equations: 

dxi dx2 dxz 

“ =» “jr (35-2) 


This procedure can he extended to nth-order equations. 

A reduction of the nth-order equation to a system of n first-order equa¬ 
tions is of some practical importance in numerical integration of equations 
on differential analyzers and electronic calculators. Such computing de- 
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vicee are usually so designed that it is simpler to calculate n first derivatives 
than one derivative of order n. The reduction has also numerous advan¬ 
tages in theoretical considerations. 

Systems of differential equations appear naturally in problems involving 
dynamical systems with several degrees of freedom. Thus, the motion 
of a particle constrained to move on a surface can be described by two 
positional coordinates^ (^,2/)- These coordinates satisfy equations of the 

form „ 

d X / dx dy \ 


df 

d^y 

df 


( dx dy \ 

‘•’'•lu-iiV- 


This pair of second-order equations can be reduced to a system of four 
first-order equations. 

Alternatively, a system of n first-order equations can usually he re¬ 
duced to a single nth-order equation. A general discussion of this problem 
is involved, and we confine our remarks to systems of linear equations, 
because such systems commonly occur in applications. 

A system of n first-order equations 


dtji 

dt 

df/2 

dt 


= (^nVl + «122/2 H-h Uin2/n + /l W, 


= ^2X2/1 + <^222/2 + --h ^2n2/n + /2(0i 


(35-3) 


dt 


Unl2/l 'h' ^n2V2 -f- * ’ * (^nnyn “4“ 


in which the and the /,(/) are continuous functions of /, is called linear. 
If the/t(/) are all zero, the system is called homogeneous. 

The system (35-S) is linear because the solutions of the associated homogeneous system 
satisfy the iine 4 irity properties slated in 8(‘c. 21. Thus if 

■ ■■, Vn\t) 

and y?\t), yf\l), .... 

are any two solutions of the homogeneou.«5 system, then the set of functions 

Ol/S" + r,y?\ .<:,!,!.» + 

is a solution of the homogeneous system for any choice of the constants c. 


* If a particle moves on a sphere, for example, x and p may be taken as the latitude 
and longitude, respectively. 
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Furthermore, it can be shown that the homogeneous system associated with (35^) has 
a set of n solutions 


!/l» . 

...{/i‘> 

first solution 

v?\ y?\ . 


second solution 



wth solution 

such that the determinant 



. yi" 



vf> 

yf' .. 

. yi?' 

9^0. 


1/)“’ 

vl* . 

. yi."' 1 



The general solution of the system (35-3) is then given by the set of n functions 

Vt ^ -f ozy?^ H-1- + M,(0, i =* 1,2, ,.n, (35-4) 

where yi «* U]{t), V 2 «2(0, •*., Vn * Wn(0 is any solution of the nonhomogeneous 
‘^V^tein and the Ct are arbitrary constants. I’he solution (35-4) is general in the sense 
tiuit the rs can be alw’ays chosen so that there is a unique solution of the system (35-3) 
satislying the arbitralily prescribed initial conditions: 

lh(k)) =« t/jo, yviik) “ 1 / 20 , Vniti}) *« ?/r»0. 

We indicate next how a system of first-order linear equations with con- 
starU coeffLnvfits can ordinarily he reduced to an equivalent single linear 
e(|uafion with constant coefficients whose order is equal to the number of 
efIllations in the system. 

(ainsider the system of two equations 


dx 

— + Oja- + 022/ = /i(0, 

at 

(35-5) 

ay 

— 4- + hy = /2(/). 


We introduce the operator D = d/dl and write (35-5) as 
(D -f Oi)x -f- 022/ = /i(0, 
hx -h (D + h)v = hit)- 
Operating on the second equation in (35-C) with {\/hi){D -f oj), we get 


(35-6) 


(D d- ai)T -}- r (^ + «i)(0 + = r (36-7) 

b, bi 

If we subtract the first equation in (35-6) from (35-7), we get, on multiply¬ 
ing through by !>i, 

(D + o,)(D -f b2)y - bioay = (D -f ai)hit) - hi/i(<)- (35-8) 
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This is a second-order linear differential equation with constant coefficients 
whose right-hand member is a known function. Hence its general solution 
y «= y(t) can readily be obtained. 

The characteristic equation for (35-8) is 

{m + -f" ^ 2 ) — ^ 1^2 ** (35-9) 

and if its roots m «= mi, m ~ m 2 are distinct, the general solution of (35-8) 
is 

y = CiC*”** + + u(0, 

where is a particular integral of (35-8). If (35-8) has a double root 
mi m 2 , the corresponding solution is 

y = + C 2 t^^ + u{t). 

Having obtained y^ we can compute the solution for x, without further 
integration, by substituting y(t) in the second equation in (35-5). Thus 

x{t) = ^ [/2{0 - 62 t /(0 

The procedure for reduction of larger systems or for systems of equations 
of order higher than 1 is similar,^ 

Example 1. Consider 

dx 

“ -h 2x - 
at 


di 


- ar -h J/ 


or {D -h 2)x -2y 

~3x -f (f> + l)y « 

Operate on the second of these equations with +-2) to obtain 
~(i> -f- 2)x -f + 2){l) 4* l)y - -f 2)e\ 
and add tliis result to the first equation. The result is 

3^(D + 2){D 4- l)y - 2j/ » }i{D 4- 2)c« 4* f, 

which simplifies to 

(O® 4- 3D - 4)y «» Sc' 4- 3«, 

This equation can be solved for y as a function of and the result can be substituted in 
the second of the given equations to obtain x. 


^ See Example 2, p. 99. 
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Example 2, Let the two masses Mi and M* be suspended from two springs, as indi¬ 
cated in Fig. 24, and assume that the coefficients of stiffness of the springs are ki and 
Jb 2 , respectively. Denote the displacements of the masses from their positions of equilib¬ 
rium by X and y. Then it can be established that the following equations must hold: 


Mi 


d^y 


-My - x), 




d^x 


x) — kix. 

These equations can be simplified to read 


By setting 


A 

Ml 




ki 

k% 


dp 

' Mi 

Mi 

(Px 

kt 

y 4* 

ki 4- ki 

d? 

Ml 

Ml 

•» c 



« h\ 



Mi ’ 



0 , 


■ 0 . 


Ml 


7U, 



the eejuations reduce to 


(Z)2 -f h^)y ~ b'^x 
'-b^my -f (D® + 4- h^m)x 


0, 

0 . 


Operating on the second of these reduced equations with (l/b^m)(L^ -f b^) and adding 
the result to the first of the equations give 

(D^ + -f a* 4- bhn)x - h^mx =» 0 

or -h (a^ -f 4- h'^m)D'^ 4- aV\x =* 0. 

This is a fourth-order differential equation which can be solved for x as a function of t 
It is readily checked tliat 

X » A sin (<*>t — t) 

is a solution, provided that w is suitably chosen. There will be two positive values of 
a> which will satisfy the conditions. The motion of the spring is a combination of two 
simple harmonic motions of different frequencies. 


PROBLEMS 


Solve the systems: 
dy 


dx 

dt 

dx 

Tt 


dt 

Zx — 2y, 


dt'^ dt 


2 y, 


dt 

dy 


-* 2x - 1/; 


2x- 


2 . 


dx 

Tt " 
(fx 


dy 

d^y 
dt^ 




V, 


x; 


dx 

dt dt 

6. (D 4* l)x 4- (2D 4- l)y « <r', (D - l)x + (D 4* i)y » 1. 

7. Determine the solution in Eicample 1 satisfying y{0) - 0, x(0) «* 0. 
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R. The equations of motion of a particle of mass m are 
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(fx 




d^y 
' dt^ 


r, 


A 
' di^ 


where x, j/, z arc the coordinates of the particle and X, F, Z arc the components of force 
in the directions of the x, y, and z axes, respectively. If the particle moves in the xy 
plane under a central atttactive force, proportional to the distance of the particle from 
the origin, find the differential equations of motion of the particle. 

9, Find the equation of the path of a particle whose coordinates x and y satisfy the 
differential equations 


, r, dy 

m “'r -f He 
di^ di 


Ee, 




0 . 


where //, JS, e, and m are constants. Assume that x ^ y dxfdi ^ dy/dl *» 0 when 
f » 0. This sysU*m of differential equations occurs in the determination of the ratio 
of the charge to the mass of an electron 

10. The currents /j and /a in the two eoui)I(‘d circuits shown in Fig. 25 .sati.siy the 

following differential equations* 

M ^2 


j-A^AA /'—1 r 

o c 
o c 
a G 

1\ 


-VWW'-1 


o, o 
r . o o 7 
^lo 0^2 
o o 




.,d'‘h . , d-h _^^dh /, 


0 , 


^2 Reduce the solution of this system to that 

Fia. 25 of a .single fourtli-oider differential equation. 

Solve the resulting equation under the asj- 
sumption that the resistances Ri and aie negligihh*. 


36. Systems of Linear Equations with Constant Coefficients. Wo have 
indicated in the preceding section how a system of linear eipiations witli 
constant coefTicients can be solved by reducing the pruidein to the solution 
of one equation of higher order. In this section we sketch another mode 
of attack on the problem of solving the homogeneous system 


dy\ 

dt 


+ «122/2 + --h Oln^n) 


dy 2 

—- = a2iyi + 022^2 d-h (^2nyn, 

at 


(36-1) 


d^n 

di 


^nl2/l + 0,n2y2 + --h O^nfiVni 
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with constant coefficients. A third method, based on Laplace transform, 
is given in Appendix B. 

Let us seek our solution in the form 

Viit) == kie^\ y2(t) ^ Vn{t) = (36-2) 

where the constants ki and X are to be determined so that Eqs. (36-1) are 
satisfied identically. 

The substitution from (36-2) in (36-1) yields 

X/tiC^' = {anki + H-h ainkn)e^\ 

Xk2C^^ = (021^^1 + 022k2 -f- * • • 4“ <i2nkn)e^^f 

\kn(^ == (j^nlkl “f* nn2A’2 -f* * * •-j- ^nnkn^C , 

On dividing each equation by and transposing all tenns to one side, w^e 
get the system 

(fill — \)ki + ai2A"2+ • • - 4* Ulnfcn = 0, 

<^2lkl + (U22 ~ X)A-2H-*4- (i2nkn — 0, 

(36-3) 

O^nlkl -4 U„2A^24* * • • 4 (Unn“X)A%, = 0. 

This system is a system of linear homogeneous algebraic equations for 
the unknown ks. It has an obvious solution 

/c^ A 2 • sss kfi 0 

corresponding to the trivial solution 

yi ^ ih ^ Vn == 0, 

Since we are interested in solutions (36-2) which are not all zero, we must 
seek values of the /rs which are not all zero. Now, a system of Eqs. (36-3) 
will have such solutions for the k^ if, and only if, its determinant ^ 

uii — X ai2 • • * Oin 
Cl2l CL22 — X * • * (l2n 

dnl Un2 ’ * ’ Unn X 

The equation D = 0 is called the characteristic equation for the system 
(36-1). On expanding the determinant,, we see that (36-4) is an algebraic 
equation of degree n in X, and thus it has n real or complex roots: 

X = Xi, X »* X 2 , ..X — Xn. 

* See Appendix A, 


= 0. (36-4) 
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If all these roots are distinct, then corresponding to each root X « 
there will be a solution of tlie form (36-2), namely, 

yxit) « y2{t) » k2e^*\ yn(t) = knC^^K (36-6) 

The constants ki in (36-5) must satisfy Eqs. (36-3) with X replaced by X». 

When Eq. (36-4) has multiple roots, the forms of solutions corresponding 
to multiple roots are more complicated. One solution corresponding to a 
multiple root X ~ X» surely has the form (36-5), but there will ^Iso be 
solutions in the form of polynomials in t multiplied ^ by 
To clarify this discussion we consider a simple example. 

Example: Solve the system 

^ -f 3y2, 

at 

( 36 - 0 ) 

dy2 ^ , 

— * 2yi -f yt. 

We take a solution in the form 

yi « 2/2 - ( 36 - 7 ) 

The characteristic equation (36-4) now reads 


On expanding it we get 



3 

1 - X 


0 . 


- 3X - 4 - 0, 


the roots of which are Xi •* — 1, X 2 ** 4. Thus, corresponding to the root Xi « —1, we 
have a Bolution 

2/1 » kie'"^ y2 « (36-8) 

To determine ki and k 2 we form the system (36-3), 


(2 — X)A:i -f" 3it2 “ 0, 

2kx -h (1 ~ X)/f2 « 0, 


(36-9) 


set X » —1, and solve it for the A:s. The result is 


ki »* ’-k2. 

Thus, one of the fas can be chosen at will. If we take ki « a, we see from (36-8) that one 
solution of (36-6) is 

yi «» 0 €~*, 2/2 « (36-10) 

the constant a being arbitrary. 

Another solution is obtained by taking X « 4. It has the form 

Vi * kie*\ 2/2 * he** (36-11) 


^ Heoall the coiresponding situation in Sec. 26. 
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with ka determined by Eqe. (36-9) with X - 4. We find this time 

fca — Hht 

so that, again, one of the ka can be chosen at will If we take ki ■* 6, we obtain a solution 

yi « be^\ ya » Hbe^K (36-12) 

From (35-4) it follows that the general solution of the system (36-6) is obtained by form¬ 
ing a linear combination of solutions (36-12) and (36-10). We thus get the general solu¬ 
tion 

yi « ac*”* 4- be*\ yt ■» —o«~‘ + 3ibe*K 

This solution could have been obtained more easily by the method of Sec. 35. Thus, 
on writing the given system in the form 

(O ~ 2)yi - 3y2 « 0, 

(36-13) 

~2yi -f (D - \)yi « 0, 

we operate on the first equation with ^(D — 1), add the result to the second, and get 
HiD - 1)(D - 2)yi - 2y, « 0. (36-14) 

The corresponding characteristic equation is 

- Dim - 2) - 2 - 0, 

or m* — 3m — 4 * 0. 


Since its roots are mi « — 1, ma »» 4, the general solution of Eq. (36-14) is 

yi « CiS*"* -f C2€**. 

From tlie first of Eqs. (36-13) we have 

yz - mo ~ 2)yi « mo - 2)(rie~* + ca««) - -cie~' -j- ^C8e«. 
This checks the result found previously. 


The main object of this section is not so much to provide a new method 
for solving systems of linear equations but to introduce a few ideas on 
which the important study of stability of solutions of differential equations 
is based. There are several notions of stability of solutions, and we illus¬ 
trate only two such by considering some simple examples. 

The system 

dy 


dt 


« X, 


— = —26x — aV 

Jf j ^ ' 


(36-15) 


is, obviously, equivalent to one second-order equation 




+ 2b 


% 

dt 


-b a^y = 0. 


(36-16) 
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As we saw in Sec, 32, its general solution when 6^ -- > 0 is 

y * ( 3 Q. 17 ) 

If < 0, we can write (36-17) as 

y s= e^^\A cos Vtt^ — t + B sin *- 6^ t). (36-18) 

If b =« a, we have the solution 

ij = e'^^\cy + C2t), (36-19) 

For 6 = 0, we have the equation 

+ a^y = 0, (3G-20) 

whoi^e general solution is 

y A cos at + B sin at. (36-21) 

We observe that if b > 0, the solutions (36-17) and (36-19) are damped. 
That is, |t/(0l — > 0 as ^ ► 00 . If 6 < 0 , these solutions are not damped 

because \y(t) | « for a sequence of \alues / ^ As regards the case 

6 == 0, we see from (36-21) that y{t) os(‘iIIates between -f \/and 
— VA“ -t- [see the lorinula just above (32-l)j. 

If we write E(p (36-16) in the fonn 

(Py _ dy 

—^-\ra y= -2?^/, y' ^ -- (3G-22) 

dr dt 

and compare it with (36-20), we are tempted to say that the solutions of 
(36-22), for small values ot b, can differ only slightly from solutions of 
(36-20), because the right-hand members of these equations arc nearly 
etiual if b is suftlcicntly small. The fact that this is not so is obvious from 
the foregoing remarks conccniing tlic ditferent behaviors of solutions of 
(36-16) for positive and negative values of 6. 

Thus, in general, small changes (or perturbations) in the coefficients of 
a dillerential equation may completely alter the nature of its solutions. 
This remark has an important bearing on the problem of coristmcting 
differential equations that purport to represent the behavior of physical 
systems. In physical problems, the coefficients in a differential equation 
are usually related to phyvsical quantities. Such quantities are determined 
from measurements which are subji‘ct to experinuuital errors. For this 
reason, it is exceedingly important to know just what effect small variations 
in the coefficients of a given equation have on the character of its solutions. 
When small changes in the coefficients result in small changes in the solu¬ 
tions, the solutions are termed stable. 
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Another type of the stability problem occurs in tlie study of the depend¬ 
ence of solutions on small changes in the initial values. In practice one 
ordinarily seeks particular solutions that satisfy specified initial data. 
The initial data are generally deternaned either experimentally or from a 
specific assumption that certain physical conditions hold. (For example, 
one may assume that the deflection of a beam at a given point is zero.) 
If the initial conditions are altered slightly, is it true that the solution of 
a given equation will not be afTeeted by a great deal? The fact that solu¬ 
tions of dift’erential equations need not be continuous functions of initial 
conditions is clear from the following examples. 

Consider the solution of 

dy ^ 

— = ~a^y, a 9^0, (36-23) 

dt 

subject to the initial condition y{G) = The desired solution obviously 
is 

2/(0 = 2/oe”““'^ 

Now, if Kq is changed by a small amount Ai/o the corresponding solution is 

yii) = ( 2/0 + 

Because of the factor 

12/(0 - 2/(0! *^0 as / oo, 

and hence for any e > 0 we can choose a to such that 
12/(0 - 2/(01 < « if / > /o- 
Having chosen /o, we let Ai/ be so small that 

I y(0 2/(01 < « if 0 < / < /o- 

Then it follows that |// ~ 2/1 < « on the whole interval 0 < / < w, and 
lienee the solutions are stable. By (30-5) similar arguments apply to sys¬ 
tems of equations with constant coefficients, and it is found that the sys¬ 
tem (30-1) has stiible solutions when all roots of the characteristic equa¬ 
tion (30-4) have negative real parts. 

On the other hand, if we solve 

dy 

— - a 0, 

at 

subject to the same initial condition y{0) = 2 / 0 , we get 

y(t) - 


(35-24) 
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On replacing |/o by yo + Aj/o we get 

§(<) “ ( 2/0 + Ayo)®"’', 

and this time \yit) — j/(<)| = e“’‘|Ayoi- This becomes infinite aS f -♦ », 
no matter how small Aj/o is, so that the solutions of (36-24) are unstable. 


PROBLEMS 


I. Use the method of this section to obtain the general solution of the system 

dyt 

Vi '^V2, -77 ^ + 1 / 2 - 


dyi 

dt 


2* A system of linear second-order equations 


dt^ 


anVl -t- 0122/2 -1 -h Oinl/n, 


d^y2 

dt^ 


f^iVl + 022^2 ^-h a2nyn, 


d^Vn 

dt^ 


*“ Ofil^l + On22/2 H-- “f Ctnnynt 


where the a,j are constants, is encountered frequently in dynamics. Show by assuming 
solutions in the form 

yt » kx cos (X/ -|- a), t ■■ 1, 2, ..n, 
that one is led to the following characteristic equation for X: 


Oil + 

Oi2 

Oin 

021 

022 + * 

02n 

Onl 

0«2 

■ • 0»n + X^ 


The constants k% are determined from the system of linear equations analogous to (36-3), 
and the constant a remains arbitrary. 

8« Reduce the system of n second-order linear equations with constant coefficients, 


d?2/» 

dt^ 


A- ^ h 

;-l ,-l di 


to a system of 2n first-order equations. 


t « 1, 2, ..., n 
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Although many functions encountered in applications are not elementary, 
virtually every such function may be represented as an infinite series. 

Nonelementary integrals like j (sin dz may be written down by inspec¬ 
tion as a so-called power series, and such series also give a simple, systematic 
method of solving differential equations. Another use of power series is in 
the study of functions of a complex variable z — x + iy] thus, from the 
series for sin x one can ascertain the appropriate definition and the impor¬ 
tant properties of sin z. A type of series known as Fourier series arises when 
one studies the response of a linear S 3 ''stem to a periodic input, for example, 
in circuit analysis, in transmission-line problems, and in the theory of me¬ 
chanical systems. Fourier series and their generalizations are also useful 
for solving the boundary-value problems of mathematical physics. Inas¬ 
much as an indiscriminate use of series may lead to incorrect results, the 
applications presented in this chapter are accompanied by discussion ()f the 
circumstances in which those applications are valid. 

THE GENERAL THEORY 

1. Convergence and Divergence. A series is a sum of terms. Thus, 1 + 3 

+ 5 is a series consisting of three terms, and ai + a 2 H-f- fln is a series 

consisting of n terms. An infinite series is a series 

+ a2 + Ug + • • * + ttn + ' • • (I**!) 

which has infinitely many terms. We shall frequently use the symbol 'Zon 
to denote the series (1-1). 

To get a numerical value for the expression (l-l) we consider the follow¬ 
ing sequence of so-called partial sums of the series, 

51 = ai 

52 *= Ui + 02 

= 01+02 + 03 


== Oi + O2 + 03 +• * •+ On 

in 


(1-2) 
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and examine the limit of the nth partial sum Sn as n oo. If 

lim Sn »» s, (1-3) 


we say that the series converges to the sum s and write 


+ 02 + ^3 + ‘ * * + + *' * • 

If the limit of Sn does not exist, the series (1-1) is said to diverge^ and no 
numerical value is assigned to the series. The precise meaning of the state¬ 
ment (1-3) is that/or aruj preassigned positive mmiher c, however small^ one 
can find a number N such that 


13 — Sn I < € for all n > N. 

To illustrate the definition (1-4) consider the senes 

111 1 . 
1-2 2 3 3-4 «(n-f 1) 

The first three partial sums of (1«5) are 


ai 


2 1 




(1-4) 

(1-5) 


and the nth partial sum is “ n/(7t + 1) (<'f. Prob 1). It is obviovxs that tlie limit 
of Sn as ^ is 1 * 1 / however, we want to prove Uiis fact, we must demonstrate that 
for any preassigned number « > 0 wo can find a number N such that the condition (1-4) 
is satisfied for all partial sums Sn with n > N. In our problem 


IS ~ »nl * 

Given « > 0, we require, then, that 

I 


1 - 


1 


and this is equivalent to 


n + 1 


n + 1 ! w + 1 


< € for n > N 


n + 1 > 


for n > N. 


Hence the choice N » (l/«) — 1 fulfilKs the requirement of the definition. If e « Jio, 
then » 9; if e ** Kooi ^hen N » 99, and ho on. To attain higher accuracy in approxi¬ 
mating the sum of the seiies (1-5) by its ath partial sum Sn. we must, clearly, incri'abe 7 i. 


The number e in (1-4) can be thought of as a measure of error made in 
approximating the sum s by the sum of its first n terms. The actual error 
in the approximation is 


fn «= 5 


(1-6) 
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and the condition (1-4) demands that |rn| < c for all sujBSciently large 
values of n. We shall call Vn the remainder of the series (1-1) after n terms. 
The limit (1-3) may fail to exist either when Sn increases indefinitely with 
n or when the partial sums Sn oscillate without approaching a limit as 
n 00. Thus, the series 

l+l+l+l+“* 

diverges because its nth partial sum Sn - n increases with n without limit, 
while the series 

1 - 1 + 1 -- 1 +••• 

diverges because its partial sums = 1, ^^2 0, 53 = 1, ... oscillate. 

As another example, consider the so-called harmonic series 

1 + H + H + 34 + M + K “t" H + H +' —h i/n H—. (1-7) 
The h^rms of the series (1-7) may be grouped as follows: 

1 + + (K + 3-4) + (34 + K + 34 + 34) + (34 H-1" He) H— *• 

(i>8) 

Now, each term of the foregoing series is at least as large as the correspond- 
mg term of 

34 + H + (34 + H) + (34 + 3''^ + 34 + 3^^ + (3^16 H-b He) “t—. 

(1-9) 

The latter series, however, reduces to 

34 + 34 + 34 + 34 + 34+'*^ 

which is divergent. Hence (1-8) is divergent. 

Tins oxamplp illustraU's the idea of comparison, which is fundamental in the study of 
series. The divergence of (1-8) was established by comparing (1-8) with a simpler series, 
'1-9), whose divergence is obvious. The full chain of reasoning is as follows: “Each term 
of (1-8) 18 at least as great as the corresponding term of (1-9). Hence the partial sums 
of (1-8) are at least as great as the corresponding partial sums of (1-9). But the partial 
sums of (1-9) become arbitrarily large if we take enough terms. Hence the partial sums 
of (1-8) also become arbitrarily large, and the series diverges The student who under¬ 
stands this example w’lll have no difficulty with the more detailed applications which 
follow. 

The use of the criterion (1-4) for convergence of the series (1-1) requires 
knowledge of its sum 5. Frequontly it is possible to infer the existence of a 
limit s without knowing its value. For example, consider the series 

0.1 + 0.01 + 0.001 + ••• 

whose partial sums are = 0.1, 82 = O.l + 0.01 =» 0.11, 53 « 0.1 + 0.01 
+ 0.001 « 0.111, and so on. Each partial sum, being a decimal, is less 
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than 1 . On the other hand the Sn increase with n. If the successive values 
of Sn are plotted as points on a straight line (Fig. 1), the points move to the 
right but never progress as far as the point 1 . It is intuitively clear that 
there must be some point s, at the left of 1, which the numbers Sn approach 
as limit. In this case the numerical value of the limit was not ascertained, 
but its existence has been established with the aid of a Fundamental 

Principle: If an infinite sequence of 
numbers «« satisfies the condition Sn+i 
> Sn for each n, and if Sn < M, where 
M is some fixed number, then Sn has a 
limit that is not greater than M. In 
Fig 1 other words: Every bounded increasing 

seq^ience has a limit. Considering 
— Sn instead of Sn gives a corresponding statement for decreasing sequences. 

From the geometrical interpretation of th(* fundamental principle it 
appears that when an increasing .sequence of partial sums has a limit, 
the difference between the successive values of Sn must tend to zero as 
n 00. Since Sn — Sn-i = u„, the foregoing statement is equivalent to 
the assertion that lirn an — 0. This can be established from the defini- 

n w 

tion ( 1 - 3 ) without appeal to the fundamental principle and without the 
assumption that Sn is increasing. 

Indeed, since 

Un — Sn—-1 (I-IO) 

and since the series converges by hypothesis, we have limsn = lim Sn-i == s 
as n —♦ 00. Hence (1-10) shows that 

lim On = lim 5 


lim Sn^i = 0. 


(Ml) 


We state the result (1-11) as a theorem: 

Theorem I. If a se 7 'ies converges, then the general term must approach 
zero, and hence if the general term does not approach zero, the series diverges. 

The reader is (*autioned that the conver.se of this theorem is not tnie. 
For instance, ih(» harmonic series ( 1 - 7 ) was found to diverge even though 
the general tenn 1/n approaches zero. 

There is a moi-e elaborate version of Theorem I which does have a converse. By 
writing out the sums m full we find a relation analogous to (1-10): 

Ofn + ani41 H-h On ** «n ~ n > m > 1. (1-12) 

If the infinite series converges, so that lim « s, then both the sums on the right of 
(1-12) become arbitrarily close to provided m and n are chosen large enough. Hence 
the right-hand side become.8 arbitrarily small in magnitude, and we are led to the follow¬ 
ing: JfXak converges, then for any e > 0 there ts an N such that 

\am + Ufft+l “b-b ttnl < e 


(1-13) 
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whenever n > m ^ N. Now this statement admits a converse.^ If, for each « > 0, 
there is an N such that (1-13) holds whenever n > m > N, then Xak converges. The 
theorem, together with its converse, constitutes the so-called Cauchy convergence criterion. 
Example 1. A certain series has partial sums » r”, where r is a constant such that 
0 < f < 1. By use of the fundamental principle, show that the series converges to zero. 
We have to show that Urn “ 0 as n —► oo, or in other words 


limr" 


0 


for 0 < r < 1. 


(1-14) 


Since r > 0, it is evident that Sn > 0, and hence the sequence «n i® hounded from below. 
Also -• rr”, or in other words 

®n4-l "* (1-15) 

Since r < 1, this shows that Sn+i < Sn, so that the sequence «« is decreasing. Hence the 
limit of 8„ exists by the fundamental principle. If we write s »» lim Sn and take the 
hmit as n —> «> in (1-15), there results 

8 » lim 8n-fi *“ lim (r«n) *“ r lim s„ * rs. 

From s rs it follows that s *« 0, since r ^ 1, and this gives (1-14). 

Example 2. The geometric series is defined by 

1 “b X -f- -j- 4“ * * * “b "I"' ■ * • 

Show that this series converges to 1/(1 — x) when |x| < 1 but diverges 
when lx| > 1 . 

The geometric series is an example of a series 

Ui(x) + U2(x) + Us(x) H-h Un(x) -f • • * 

in which the terms are functions of x. For each choice of x the function 
Unix) is simply a number, the series becomes a series of constants, and 
hence it can be tested for convergence just as any other series of constants 
is tested. 

We have to decide whether the partial sums 


«n == 1 + X + + • • • + (1-16) 

tend to a limit. If the foregoing equation is multiplied by x, there results 


XSn 


X + x^ H-h x” ^ + x” 


(1-17) 


and subtracting (1-17) from (1-16) yields $n — xs„ = 1 — x”. Solving for 
Sn we get 

1 — x'* 

5n « -- (1-18) 


1 — X 

* Since we shall not require the converse, the proof is not presented here. The inter¬ 
ested reader is referred to 1. S. Sokolnikoff, Advanced Calculus,’’ pp. 11-13, McGraw- 
Hill Book Company. Inc., New York. 1939. 
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[f |x| <1, then lim |:c|’‘ » 0 by (1-14) and hence (1-18) gives 

n « 

, 1-0 1 

hm Sn ~--- 

n~* 00 1 — X 1 — X 

This establishes the required convergence when |:rl < 1. On the other 
hand if |x| > 1, the general term does not approach zero and the series 
diverges Theorem 1 . The value x is called the ratio for the series, sin(*e x 
equals the ratio of two success! ve term. 4 . We have shown that the geometric 
series converges if, and only if, the ratio is less than 1 in magnitude. 

PROBLEMS 

1. Show that the nth partial sum of the series (1-5) is n/(n 4' 1) Since 

l/[n(n 4“ 1)) ** l/n — l/(n 4- )), the sum of tlie first n terms is 

«n -> (H - J^) + (H - 4- (Is - 4- - •' 4- 11/« - l/(n -f D) 

/i series such as this is called a telescoping senes. 

2. Show that the follow!rig scries convergtis to zero if \r\ < 1 but to 1 if r =« 1 Sketch 
the graph of the sum as a function of r: 

r 4- - 0 -f - r^) -f •' + {r^ - 

2. Some Basic Properties of Series. We bhall write infmile b(u*ies in the 
condensed notation 
00 

23 a„ ^ ai 4“ U2 4' ua + *' * + + • * * • (2~1) 

Finite sums are f‘\prc.s.scd similarly, with the limits of summation 
replaced by the appropriate values. The limits of summation are frequently 
omitted if th(\y need not be emphasiztHl or are clear from Ihti context. 
Wienever the limits are omitted in S<‘cs. 2 to 7 of tliis chajiter, the read(*r 
may assume that the summation range is from 1 to co. 

In many respects coxivergent series behave like finite sums. For example, 
if the sum of the series (2-1) is .s and if e^ich term of the series (2-1) is multi¬ 
plied by a constant p, then 

Spa,, = pSa„ ps. (2-2) 

That is, a convergent series may he multijdied termwise by any constant. 

The proof of (2-2) follows at once inun die observation that the partial 
sums Sn of the series 2pa„ are related to the pnrlial .sums Sn of (2-1) by 

Sn P«u 

lim Sn ^ p lim «« = ps. 


and therefore 
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If W6 are given two convergent series Sajk and then 

S(aA ± bk) ~ Sdjfc dr (2-3) 

That is, two convergent series may be added or subtracted term by term. Again 
the proof is simple. We denote the sum of the series Xun by A, that of Xbn 
by J 3 , and the corresponding partial sums by An and Bn- Then the nth 
partial sum of 2)(afc zb bk) is 

n 

(Ufc zb hjc) = An rb Bn 

and the result ( 2 - 3 ) follows on letting n —► qo. 


As an illustration, consider the geometric senes 


By (2<2) 


fl - 1 -f x -f +x« -f 

Ts «* X 4- X® H - h x” 4- 


and hence, by (2-3), we have « — xs =» 1. This shows that if the series converges, it must 
converge to 1/(1 — x). The (|uestion of convergence was discussed in Example 2 of 


Sec. 1, 


Another obvioua but important property is used so often that we state 
it as a theorem: 

Theorem I. If finitely many terms of an infinite series are altered the 
convergence 'is not affected {though^ of course^ the value of the sum may be 
affected). 

To prove this we denote the original terms by Ok and the new terms by 
Ok + bkj where all but a finite number ^ of are zero. The result is then 
a consequence of ( 2 - 3 ). U should be noticed that this argument not only 
establishes coiniTgence but shows that the rawv value of the sum can be 
found by the oliviou.s arithmetical calculation. For instance, if the seventh 
term of a convergiuit .series i.s im*rea.sed b}^ 2A the sum i.s also increased by 
2 .4, and similarly in other cases. 

Example: Establish the divergence of 

M2 4 - he 4 - + ^4 d-. ( 2 - 4 ) 

Multiplying by 4 wc get the scnch 

Ms 4 -K 4 - 35 + 3^6 4 --- 

which la obviously divcrgf^nt, birice it diffcus from the harmonic series Zl/n only in that 
it lacks the first two tcims. Hence (2-4) is divergent. 

* Any finite si'rics 4 - • • • + niay be regarded as an infinite Sf\rie8 with all terms 
beyond the nth equal to -^ero. If we do so regard it, the definition of convergence given 
in Sec. 1 makes the finiW series converge to its ordinary sum. 
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This use of (2-2) to establish divergence is readily justified, even though (2-2) applies 
to convergent series only. Thus, assume that (2-4) converges. The foregoing analysis 
shows, then, that Sl/n would have to converge, and that is a contradiction. 


PROBLEMS 

1* Write the following series in full, without using S notation: 


^ I ^ f 43y A n* -fl A3 
a 2fc’ h V u) ' 2n H- 3’ ^7 j ’ 


(cos x)‘^. 


2. Write the following series in condensed form, using 2 notation: 




M + k^ + H24-M6+*- 


_JL„ ^-1- _j-1-1- 1 -1- 0.2 - 0.02 + 0.002 

1,000 1,002 1,004 1,006 * 


i + _L + _L_+__L_ + . 

1 1-2 1-2-3 1-2-3-4 


Mo + + ^0 + Ho H-• 


3. Some of the series in Probs. 1 and 2 are divergent because the general term does 
not approach zero. Which ones are they? 

4. Some of the series in Probs. 1 and 2 are convergent l>ecause they are geometric 
series with ratio less than 1 in magnitude (or multiples of such a series). Which ones 
are they? 

3. Some of the series in Probs. 1 and 2 are divergent because 2l/n is divergent 
Which ones are they? 

6. Show^ that (1 — 1) -f- (1 — 1) -f (1 — 1) H-converges but would diverge if the 

parentheses were dropped. 

7. (a) Does the series 2 converge? Explain. (6) Docs the 

aeries S [1 - Q"] converge? Explain. Hint In (a) see (1-5). In (6) note that 
^ [;-(!)] . If the given series converges, what could you deduce about 


3. Improper Integrals and the Integral Test. In the development of the 

fh 

calculus a definite integral such as / /(x) dx is defined, at first, only for a 

Ja 

finite interval [a,h]. The extension to an infinite interval is then made by 
a simple passage to the limit; thus 


f f{x) dx = lim [ f{x) dx. 

Ja —» 00 'o 


The integral at the left of (3-1) is called an improper integral. If the limit 
at the right exists, we say that the improper integral converges (to the value 
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of the limit) and it dwerges if the limit does not exist. The definition is 
quite analogous to the corresponding definition 

« n 

53 a* = lim 5Z Ofc 

Jk«.l n • kmml 

for infinite series. 

An example of a divergent improper integral is 

f* - dx » lim / — ** lim (log x 11 ) » lim log b (3-2) 

Jl X Ji X 

On the other hand if p is constant and p 3 ^ 1 , then 

r\dx-]im tx-^dx - hm f) - lim (3-3) 

j\ Si VI — p li/ 1 — p 

The question of convergence now depends on the behavior of b^‘~^ as b —♦ «. If the 
exponent 1 — p is positive, then b'""^ « and the integral (3-3), like (3-2), is divergent. 

But if 1 — p is negative, then p — 1 > 0 and hence 

ffi-p mx —0, as b —♦ po. 

5P~i 

In this case the integral (3-3) converges to the value l/(p — 1 ). 

The result of this discussion may be summarized as follows: 

1 

Theorem I. The improper integral I — dx converges ifj and only if, the 

Si 

constant p > 1 . 

Theorem I suggeste the following analogous result for infinite series: 

GO J 

Theorem II. The infinite series 53 — converges if, and only z/, the canr 

ik-i 

Biant p > 1 . 

It will be seen that Theorem II is valid; in fact, there is a close connec¬ 
tion between infinite series and improper integrals which will now be dis¬ 
cussed. 

Suppose the terms of an infinite series Sa* are positive and decreasing; 
that is, Un > Un+i > 0 for each positive integer n. In this case there is a 
continuous decreasing function /(x) such that ^ 

On = /(w)i = 1, 2, 3, .... (3-4) 

Each term a„ of the series may be thought of as representing the area of a 
rectangle of base unity and height /(n) (see Fig. 2). The sum of the areas 

^ For instance, let the graph of y » f{x) consist of straight-line segments joining the 
points (n,an) and (n -f- 1 , Cn+i)* 
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of the first n circumscribed rectangles is greater than the area under the 
curve from 1 to n + 1, so that 


rn+l 

Oi + 02 H-h On > y fix) dx. ( 3 - 5 ) 

r <30 

This shows that if the integral I f{x) dx diverges, then the sum also 
diverges. 

On the other hand, the sum of the areas of the inscribed rectangles is 



Fi<j. 2 


less than the area under the curve, so 
that 

02 + O3 + ’ ' ‘ “t" fi^) dx. ( 3 - 6 ) 

If the integral converges, we have [since 
fix) > 0] 

/(x) dx < fix) dx s M, 

so that the partial sums are bounded 
independently of n: 

Sn = Ol + «2 H- h- an < M + ai. 


Since each Ok is positive, these partial sums fonn an increasing sequence. 
Hence, the fundamental principle stated in Sec. 1 (‘nsures that is con¬ 
vergent. 

The result of this discussion may be sxjinmarized as follows: 

Theorem III. For x > I let fij') be jX)S 7 tm\ ronUnNoas, and decreasing. 

« /-oo 

Then the series Yl f{n) and the integral / fix) dx both converge or bothdi-- 

^ -/i- 

verge. In either case the partial sums are bon tided as follows: 

r^'fir) dT<j: f{k) < / 7 (x) dx + /(l). ( 3 - 7 ) 

Choosing/(x) == x""^ in Theorem III, we see that Theorem II is a conse¬ 
quence of Theorem 1 . The test for convergence contained in Theorem III 
is commonly called the Cauchy integral lest, thougii it was first discovered 
by Maclaurin. The nvsult ( 3 - 7 ) is especially useful because it enables us 
to estimate the value of the sum. 

Example 1. Show that the series 

_J_ + ^ + _L_ + _JL_+...+__L_+... 

1 ^ 1 •+ 2’ 1 -f- 3* 1 -f- 4“ ^ 1 + n* ^ 

converges to a v^lue which is between 0.7 and 1.3, 
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Here we choose/(x) •• 1/(1 4- ®*). Since 

1 X -1 1 * If 

] IJ ~ ~ j « j, asb «e, 

the integral is convergent, and hence the series is convergent. Moreover 

0.79 « 7 < E r-T-T* < 7 + n “ 1'29 
4 ;t 1 -f- 4 2 


C 


by letting n —»► <» in (3-7) and noting that/(I) ]4. The next example shows how the 

accuracy in such an estimate may la? improved to any extent desired. 

Example 2. Compute the sum of the following series within irO.Ol: 


1 -<1" d— "h —“ H— ~ H- 4 " * 

4 9 16 25 36 


' 4 -~ 4 ‘ 


It is easily verified that the first six terms give the sum L49L To estimate the re¬ 
mainder we have, from (3-7) on taking/(x) =« l/(x 4 6)^ 

r (xTii? ^ ? («+6? ^ +0? ^ 49' 

The two limits in (3-8) are 0 143 and 0 163, as the reader can verify. Hence 

1.634 - 1.491 4 0.143 < x < 1 491 4 0 163 « 1.654. (3-9) 

It is intx'resting to see bow many terms are needed to get the same accuracy by direct 
computation. Tin* remainder aftei n terms is given by (3-7) as 

t r v'^= -■ 

n4 1 

To make this as small as the iineeitainty interval 1 654 — 1 634 obtained in (3-9), we 
must have l/(n 4 1) < 0 02, or n > 19 Thus, direct summation of the series requires 
almost 50 terms for the ae<'uia(\v which we obtained by adding 6 terms only. 


PROBLEMS 

1. Test the following int(*grals for convergence, and evaluate if convergent: 

(lx /*'* dx 


rjf^, n-Mx. r-^, r 
A 1 4 X A A A -r(iog x)® A . 


X logx 


2. Test the following series for convergence: 

1 ^ ^ 1 


'(n 4 1)^ 


1 V' _1_y _ , V 

’ n«2 w(It>g n) /ri *^14 n* 


8. (a) For what values of the constant c does j^e^^dx converge? (b) Using the 

result (a), discu.ss the convergence of (c) Show that the series (b) is a geometric 

series, and also show that your results are consistent wdih those of Sec. 1. 

4. How many terms of the harmonic series are needed to make the sum of those 
terms larger tlian 1,000? 
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do 

6. Estimate the value of by tiu'wt use of Theorem III and also by adding the 

first five term® and using Theorem III to estimate the remainder. In both case® find 
approximately how many terms of the series you would have to add up to get comparable 
accuracy. 

Problem for Review 

6. (o) By (1-18), show that the partial sums of the series 
1+2+^ + ^ -- 

are all less than 2. 

(6) Show that the partial sums of the series 

1 4 . JL 4 - JL 4 - JL ..j—L JL j— 

are also less than 2. Hint' Compare the partial sums with those of the series (o). 

(c) Deduce, by the fundamental principle, that the series (6) converges. 

4. Comparison Term by Term. One way to test a series of positive 
terms for convergence is to compare that series with another whose con¬ 
vergence is known. Let Zon and Zbn be two series with positive terms such 
that On < bn and Xbn converges. The inequality 

n n oc 

s« - Z On < Z ^„ < E *>n 

1 1 1 

shows that the part ial sums s„ are bounded, and since Sn is increasing, the 
limit exists by the fundamental principle. It is left for the student to verify 
also that if (in ^ b^ > 0 and Zbn diverges, then Zon diverges 

This diaciission establishes the following result, known as the comparison 
test: 

Theorem I. If 0 < On < bn^ then the convergence of Zon follows from the 
convergence of Zb^. And if On > ^ 0 , then the divergence of Zon follows 

from the divergence of S6„. 

Since the first few terms of a scries do not affect the convergence, we need the hy¬ 
pothesis not for all n but only for n sufficiently large (see Sec. 2, Theorem I). Similar 
remarks apply to every convergence test, and we shall make constant use of this fact in 
the sequel. 

For example, suppose we want to establish the convergence of 29/n”. Although the 
inequality 

9 1 


is not valid for all n, it is valid when n Is sufficiently large. Hence the series converges by 
comparison with the geometric series. Another example is given by the series 

100 log n' 
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1 _ 1 

100 log n n 

for all n, this is true for all sufficiently large n, and hence the series (4-2) diverges by com¬ 
parison with the harmonic series. 

It is customary to writhe an bn (read **an is asymptotic to 6„’’) if 


an 

lim — =1 

n 

(compare Chap. 1 , Sec. 2). For example, n + 1 n and also 5n^ + 3 n 
+ 4 ^ 5 n^, but it is not the case that 2 /n 1/n even though the difference 

between these quantities tends to zero. In this notation we can state the 
following theorem, which is very useful for determining convergence; 

Theorem II. // an ^ bn and bn > 0 , theti the series Xan and 'Ebn a,re both 
convergent or both divergent. 

The proof is simple. Since lim (an/bn) == 1, we shall have 


1 a« 

- < — < 2 

2 bn 


whenever n is sufficiently large. Equation ( 4 - 3 ) yields 

}>ibn < an < 2bn 


( 4 ^) 


and hence the conclusion follows from Theorem I together with ( 2 - 2 ). 

Example I. Docs " converge? 

For all large n we have log 7i > 2 (since log n Hence 


for all large n, and the senes converges by comparison with the convergent series Zl/n* 
(Theorem II, Sec. 3). 

Example 2, Does Z(n^ 5n -f 3)“^ converge’ 

Inasmuch as -f 5/i -f 3 n*(l + 5/n -|- 3/nO n^, we iiav© 

(n* + 5n + 3)->< ~ = n"* = ~ 

n 

Since 21/n diverges, the given series diverges. 

Example 3. Consider the series 

s ( + 'I"* + 1 

^ V7n’ + 5n* + 8n) ' 


Since n* + 4n* + 1 ~ n\ and since 7n^ + 5n* + 8n 

/_n‘ \H J_ 

W/ “ 7 >‘nW’ 




totic to 


I n , uue geuer24i 




--.r -~jr 


The series with general term 1/n^ converges by Theorem II, Sec. 3, and hence the given 
series also converges. 
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Examples 2 and 3 illustrate two properties of the relation which 
are now set forth explicitly. First, we show that ayiij polynomial is asymp^ 
toiic to its leading term. Indeed, if a 0 and m > 1, then as n 


an^ + hn^ ^ + • • • + rn + ,s' h 

- -^-=« 1 d-1-h 


an" 


an 


an 


tn—1 


+ 


—> 1 . 


an 


This shows that + hif' ^ H- rn + s ^ an'^, as stated. 

Second, if an bn and Cn ^ dm then it follows that 

br 4 i 


for any constants a and fi. To establish this consider the ratio 


\bj \dj 


= 1 . 


PROBLEMS 

1 . Test the following series for convergenee by comparing with the .senes 

V _i_ y_L_ V V _ 

y/Ti 2ri's/ n + 1 + 1)* (2ii + 1)'^ 

2 . Test the following senes for convergence by using I’hi'orctn 11: 

2V - ±2 /'I + - V i: !, 2 -i±i. 

-Y \ ^ 3 n® -f n \n 4 ” -f n* + 4 

3 . Test the following series for *-onvergence by any method; 

1 1 n*^ 

Vg—n* V___j V j V 

’ n log (n -b 1) -f 3 

4 . (a) If On ^ hn and bn show that an c„ (h) If ~ bn Miid Cn ~ dn, is it 

necessary that a„ -f Cn bn -f Prove your an.swer by an (r) Fin<l an 

and bn such that an hn but an — bn (d) Find «n and hn such that an/bn °o 

but an -- bn 0. 

Problem for Review 

6. (a) By direct use of the definition of limit show that 0 . 111111 ...=“ I9. Hint' 
If Si *» 0.1, 82 « 0.11, sz ®= 0.111, ...» then |si — j^l *=* U2 — I * i^00» aud 
SO on. (6) With »n as in (a), and with e > 0 , how large must you choose N to make 

l«n — I < « for ail n > A^? 

(c) If « «* 0.111111evaluate « by considering 10a — a. (d) Evaluate a in (c) by the 
formula for sum of a geometric series. 
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6. Comparison of Ratios. It often happens that the general term of 
an infinite series is complicated whereas the ratio of two successive terms is 
simple. For example, in the series 


we have 



^n-fl 

dn 


^ 2 n -\-2 ^2 

(n + 1)! n + 1 


(5-1) 

(5-2) 


The following theorem cnai)les iia to deduce convergence by considering 
tliis ratio rather than the general term itself: 

Theorem I. Lei Sun and 2)5„ he two series with posHive terms. If 


dfi hji 


1 , 2 , 3 , 


(5^3) 


then the convergence of Xun follows from the convergence of And if 


dji hfi 


n - 1,2,3, . 


(5-3a) 


the7i the divergence of Sun follows from the dicergi nee of S6„. 
The proof is simple, Tn the first case we ha\'e 


02 03 ttn ^ ^2 63 

On = ai-- • • -< ai — - 

0^02 a„__i 6162 


K «i ^ 

- = — 0, 

hn-^l h, 


Hence the convergence of ^hn impli(‘h that ol 2a,, by the comparison test 
(Theorem I, Sec 4). The discussion of (5-3a) is similar. 

If we take = r” in Theorem I, then 26,^ converges whenever r < 1. 
Also 


^n-fl 


"M 


r. 


h 


n 


r 


n 


Hence the theorem shows that converges if there is a fixed number 
r < 1 such that 

^^<r, «=],2,3, .... (5-4) 

a% 


Since the condition 
verges whenever 


(5-4) is needed tmly f<3r large n, the series 2a,t also con* 


lim 

n -+ « 


1 


= r < L 


(5-5) 
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The test based on (5-4) and (5-5) is termed the ratio test To illustrate 
the ratio test consider the series (6-1). By (5-2) we have 

lim-== lim-*= 0 

an n + 1 

and hence (5-5) holds for all x. Thus the series (5-1) converges for all x. 


The ratio test is useful but very crude. It cannot oven establish the convergence of a 
series such as which is rapidly convergent. To obtain a better test one inay use 

the series iSl/n^ for 26„ rather than the geometric series In this case 


tin 


1 

vn + 1)^ 






By the binomial theorem * 


and hence 



P , P (P 4- 1) 
n 


_ j __ P 

bn n 


Since Xb„ converges if p > 1 and diverges if p < 1, we are led to the result stated in 
part (6) of Theorem XL The result (5-5) is stated in part (a) 


Theorem 11. Let Xon be a series of positive termSy and let r ami p be con¬ 
stant. (a) If an-^i/On ^ r, theji Zon converges u'hcn r < 1 and diverges when 
r > 1, (b) If a« 4 i/a„ — 1 —p/n, then 2^a„ converges when p > 1 and 

diverges when p < I, 


Example 1. Does 2n^/2” converge? 
With On » n^/2^ we have 

(a -j" 1)“ 2” 






Hence the series converges by the ratio test, Theorem I la. 
Example 2. Apply the ratio t(jst to the harmonic aeries. 
With On * 1/n we have 

On+i ^ ^ ^ 

a„ n 4- 1 


Since this is the case r « 1, the test gives no information. Moreover, 


On+l _ j ^ n 

an n 4- 1 


1 1 

■■ — —* 

n 4- 1 n 


Since this is the case p «- 1, the more refined test of Theorem lib also gives no infor¬ 
mation.* 


* The binomial theorem for arbitrary exponents is established in Sec. 12. 

* More general tests may be found in I. S. Sokolnikoff, “Advanced Calculus," chap. 7, 
McGraw-Hill Book Company, Inc., New York, 1939. 
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Example 3. For what values of the constant c does the following converge? 

c . c(c + 1) . c(c + l)(c + 2) . 

— -h ' • *. 

11 2! 3! 

For sufficiently large n the terms are of constant sign, and hence Theorem II is ap¬ 
plicable. We have 

an-fi ^ c(c 4- l)(c 4- 2) ... (c 4- n) _ c -f n 

On (n -f 1)1 r(c -h 1) ... (c -f n - 1) n + 1 

Since 

c -f n ^ j ^ c - 1 ^ ^ _ Lzf 

n-bl n-hl n n 

the series is convergent if I — c > 1 and divergent if 1 — c < L Hence, it is convergent 
when r < 0 and divergent when c > 0. In this example Theorem I la gives no informa¬ 
tion hut Theorem 116 solves the problem completely 


PROBLEMS 


1. Determine the convergence by using the ratio test, Theorem IIo: 


1 



n\ 

n* 



"" a! 


2. Show that Theorem Ila gives no information, and tost for convergence by The 
orem 116: 

y ^ V__, V 

^ ^ r(c -|- l)(r 4- 2) ... (r -1- 71 ) 4”(n!)* 

3. Test for convergence by any method: 

y_^_y log ^ V 2 ^ - 1 

^ 1-3-5 ... (2n + 1)'^ n* ’ 3^^ + 4”’ " 2 n + T’ 

4. Give an example of a divergent seriea Zon such that all the terms satisfy > 0 
and a„+i/a„ < 1. Does this contradict the remarks made in connection with (5-4)? 

6. If j IS constant prove that lim T”/n! « 0 . I/ 77 U I'he senes Z ia:|”/a’ converges 

n » 

by the ratio test. 

6. Absolute Convergence. The preceding tests for convergence apply 
to series with positive terms. We shall now see how these tests can be used 
to establish convergence even when the signs of the terms change infinitely 
often.* 

Dpzfinition. a senes Zan is said to be absolutely convergent if the series 
of absolute values w | Un | is convergent, 

' If all but a finite number of terms have the same sign, then we may consider those 
terms only (Sec. 2 , Theorem I). Multiplication by — 1 , if necessary, yields a series with 
positive terms, so that the foregoing methods apply. This fact was used in Example 3 
of the preceding section. 
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For example, the series 2(—is absolutely convergent, since 


S|anf 



S 


converges. On the other hand tlie series is not absolutely con¬ 

vergent, as the reader can verify. ''Idie importance of absolute convergence 
sterns partly from the following theorem: 

Titeorem 1. converges^ then converges. 

In other words, an absolutely convergent series is convergent. The defini¬ 
tion of absolute value yields 


0 < 


On + 
2 



Hence, by the comparison test, the series 

On + lOnl 
2 


converges when ZU/nl converges And then the series with general term 


On — 



converges by (2-2) and (2-3). 

Tc> iliustriite the Uhe of Theorem I coiLsidcr the series 

^ eos nj 


( 6 - 1 ) 


Smee the Mpis chariK^' infinitely often,’ none of the preceding methods is applicable 
\V<* may, liow(‘ver, apply those imdhods to the senes of abbolute values. In view of the 
fu<‘t t,hat 

I (M)S 7}.r I 1 

-—■ - , 

2» — 2’'* 


the series 

is convergent. Hence the original series ((>-1) is eonvergimt. 

A series wln^so terms arc alternately positive and negative is called an 
alternating series. There is a simple test due to Leibniz that establishes 
the C/onvergence of many such series even when the series does not converge 
absolutely. 

^ Kxoept v'hcn x is an integral multiple of 2 t. 


CO.S nx 
2^ . 


cos nx 
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Theohem II. Suppose the aUemaiing series 53 (—is such that 

n—l 

fln > > 0 aruf lim an = 0. Then the series converges^ and the remainder 

after n terms has a value which is between zero and the first term not taken. 

For example, if the sum of the series is approximated by the first five 
terms 

a =3= Sg sr ai — a 2 + 03 — 04 + 05 , 

then the error in that approximation is between zero and —oe: 

0 > s — S 5 > — Ofi. 

The value given by is too largcy because S 5 ends with a positive term, + 05 . 
The value sq is too small, since sg ends with a negative term, and so on. 

To prove the theorem, we have 

«8n (<ll *- 02 ) + (ag — «4) H-h (a2n-l — CtSn) 

« ai — (a2 ~ tta) — • ~ (a2n-2 — 02n~l) — OSn 

and hence S 2 n is positive but kss than cj for all n. Also 

S2 < «4 < .. 

so that these sums tend to a limit by the fundamental principle (S<'c. 1). Since 
« 2 n + 02 n-f 1 and liin a 2 n-i 1 =* 0, it follows that the partial sums of odd order tend to 
this same iimit, and hence the siTies converges. The proof of th(‘ second statement is 
left as an exereisi^ for the reader. Actually Theorem II beconu^ ratla^r obvious when we 
plot the partial sums on the ar axis. 


Since the choice = \/n satisfies the reiiuirements of Theorem II, the 
alternating harmonic scries 


1 1 1 I 1 (-1)^+^ 

«== 1 +-+. 

2 3 4 5 6 n 


( 6 - 2 ) 


is convergent. If the sum is approximated by the first two terms, then 
Theorem II says that the error is between 0 and 3^; that is, 0 < s — 3^' 
< or 

< s < %. ( 6 -^) 


Inasmuch as the series of absolute values diverges, we could not establish 
the convergence by use of Theorem 1. A series such as this, which con¬ 
verges but not absolutely, is said to be conditionally convergent 

By rearranging the order of terms in a conditionally convergent series, one can make 
the resulting series converge any desired value. In illustration of this fact we shall 
rearrange the series (6-2) in such a way that the new sum is jt, though (6-3) shows that 
the original sum is not v. 
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The terms of (6-2) ere obtained by choosing alternately from the series 

1 + H + H + M+--- (6-4) 

and from the series 

(e-5) 

both of which are divergent. To form a aeries that converges to ir, first pick out, in order, 
as many positive terms (6-4) as are needed to make the sum just greater than tt. Then 
pick out, in order, enough negative terms (6-5) so that the sum of all terms so far chosen 
will be just less than ir. Then choose more positive terms until the total sum is just 
greater than ir, and so on. The process is possible because the series (6-4) and (6-5) are di¬ 
vergent; the resulting series converges to x because the error is less than the last term 
taken. 

To get a physical interpretation of this result, suppose we place unit positive charges 
P at the points 

X - 1, -V2, V3, -V4, VS, -VS _ 

and attempt to find the force on a unit negative charge N located at the origin (see Fig. 
3). By Coulomb's law two opfiosite unit charges a distance \/n apart experience an 


•“Vs—V s—Va — V 2 0 VI V 3 Vs V 7 

Fig. 3 


attraction of magnitude 1/n. Since the attraction of charges at the left of N exerts a 
force toward the left whereas attraction of the other charges exerts a force toward the 
right, the total force on N is given formally by the series (6-2). Now, the fact that 
this series is conditionally convergent makes the force dei>end not only on the final con¬ 
figuration of charges but also on the manner in which the charges wore introduced. If 
we obtained the final configuration by putting 10 charges at the left, then 1 at the right, 
then 100 more at the left, and I again at the right, and so on, the net force will be di¬ 
rected toward the left. But if we had a preponderanc,<» of chaigos at the right while 
setting up the final configuration, then the final force would be directed toward the right. 

The foregoing behavior is perha|)8 not very surprising. What is surprising is that a 
rearrangement such as this will always give the same value provided the series in (|ues- 
tion is absolutely convergent. For example, let the configuration consist of unit positive 
charges P at the points x =« 1, —2, 3, —4, 5, —6, ..., so that the force is given by the 
absolutely convergent series 


1 1 1 

1 -i5+p-^+--- + 


(- 1 )" 


In this case, as we shall show, the force does not depend on the way in which the final 
configuration was reached.‘ 

The preceding examples may assist the reader to appreciate the following 
theorem, which describes what is perhaps the most important property of 
absolute convergence. 

Theorem III. The terms of an absolutely convergent series may be re- 
mranged in any manner without altering the value of the sum. 

' One may say that the '^charges at infinity” now have no influence, whereas in the 
former case (6-2) they were important. 
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We establish this result first for series of positive terms. Let Sp* be 
such a series and 2pi a rearrangement. For every n we have 


inasmuch as each term p* is to be found among the terms of Spjk. Hence 
2 plk converges (by the fundamental principle), and also 

Spl < Xpk. 

In just the same way we find 2p* < 2pl, and hence = Zp*. 

To obtain the result for an arbitrary but absolutely convergent series 
Xak, denote the rearrangement by and observe that 


ak « (fl/fc + \akl) - la/cl 
dk « {a'k + (all) - (all. 


(6-6) 


By the result for positive series we have 

Slall =21a,t 

S(al + (all) « S(a/fc + |ajfe|). 

Hence ( 6 - 6 ) gives 2al = Sa* when we recall (2-3). 

By methods quite similar to the foregoing ^ one can establish the follow¬ 
ing, which expresses a third fundamental property of absolutely convergent 
series: 

Theorem IV, If 2a* == a and 26* = b are absolutely convergent^ then 
these series can be multiplied like finite sums and the product series will cm- 
verge to ab. Moreover, the product series is absolutely convergent^ hence may 
he rearranged in any manner. For example, 


ab = aibj + (a 26 i + ^ 1 ^ 2 ) “b “b 0262 + cn^> 3 ) + * • *. 


Example: Consider the series Xx^/\/n. 
With a* x^/y/n we have 

lOn-fll ^ I Vn I 

lonl I Vn 4- 1 I 


|x| 




Hence the series converges absolutely if Ixj <1 and diverges if lx| > 1. To see what 
happens when x •" dbl, we substitute these values into the original series, obtaining 


2 


(~ir 

Vn 



for X » —1 and x -* -f 1, respectively. The first series is conditionally convergent, 
and the second is divergent. Hence the series converges absolutely when [x( < 1, it 
converges conditionally when x ■* — 1, and it diverges for all other values of x. 

^ The proof is given in full in Sokolnikoff, op. dl., pp. 242-244. 
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PROBLEMS 


1. Classify the following series tis absolutely convergent, eonditionolly convergent, or 
divergent: 


' Vn ’ 2n" ’ r? ’ F 3 


3 b 309 


C3rv7 


•f • 


3*6*912 

2. Determine the values of r for wdiich the following series are absolutely convergent, 
conditionally convergent, or divergent: 

S(-l)" --- S(-l)» - f y £(-l)V, 2 U S 2 

n (2n)’ nj” n \j -f 4/ log (n -f 1) 

00 

3. Approximately how many terms of the series ^ ( —I)"/?!** are needed to give the 

1 

sum within lO"**? Kvaluate the sum to two places of df'cirnals 


7. Uniform Convergence. If a finito numlior of functions thnt are all 
continuous in an interval ^ [«,/>] arc added togethfT, the sum is also a con¬ 
tinuous function in [a,6). The questitm arises as to whether or not tliis 
property will he retained in the case of an infinite series of continuous 
functions. Moreover, it is frequently desirable to obtain the derivative 
(or integral) of a function /(.r) i>y means of terTn-})y-term differentiation 
(or integration) of an infinite series that defines/(./). I'nfortunately such 
operations are not always valid, and many important iii\ estigations have 
led to erroneous results solely because of the improper handling of infinite 
series. The analysis of these questions is based on a property known as 

uniform convergence, wliich is now to be descrilied. 

£» 

If a series of functions ^ Un{x) converges for each value of x in an inter- 

n=al 

val [a,6], then the sum defines a function of x, 

s{x) = 2an(x). 

We denote the nth partial sum liy Sn{x), 


Sn(x) == Ui(x) + U2ix) + V:i(x) d-h Unix), 

and the remainder after n terms by rn(,r): 

rni^) = six) - 6V^(:r) M„4.i(.r) + Un^2i^) H-• (7-1) 

Since the series converges to six), lim 5^i(x) = six) as n -h* co, and hence 

lim Tnix) = 0. (7-2) 

The statement embodied in (7-2) m(‘ans that for any preassigned positive 
number 6, however small, one can find a number N such that 

|rn(a:)| < e for all n> N. 


^ We use [a, 6] to indicate the closed interval a < z < 6, 
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It is importaat; to note that, in general, the magnitude of N depends not 
only on the choice of € but also on the value of z. 

This last remark may be clarified by considering the series 


Since 


a; -f (x — 1 )t 4* (aJ - l)x® 4-{- (x — 4-, 

«„(x) « X 4“ (x - l)x 4* (x - l)x* H-4- (x ~ l)x»»“^ « x”, 


it is evident that 

lim 8n(x) s lim x” « 0, if 0 < x < 1, 

n » n -* 00 


Thus, «(x) •* 0 for all values of x in the interval 0 < x < 1, and therefore 
lrn(x)l « |fi„(x) - s(x)i «« lx"" - 01 * x”. 

Hence, the requirement that lr„(x)| < <, for an arbitrary €, will be satisfied only if 
x” < «. This inequality leads to the condition 

n log X < log €. 

Since log x is negative for x between 0 and 1, it follows that it is necessary to liave 

log € 

n > - 

logx 

which clearly shows the dependence of N on both f and x. In fact, if f =» 0.01 and 
X « 0.1, n must be greatt^r than log 0 01/log 0 1 « — 2/( —1) «• 2, so that N can be 
chos^m as any number greater than 2. If € »« 0 01 and x *= 0 5, must be chosen 
larger than log 0.01/log 0.5, which is greater than 6 Since the values of log x approach 
zero as x approaches 1, the ratio log t/log x will increase indefinitely and it will be im¬ 
possible to find a single value of N whi(‘h will serve for e *» 0.01 and for all values of 
X in 0 < X < 1. 


This is the situation which is to be expected in general. In many impor¬ 
tant cases, however, it is possible to find a single, fixed for any preas¬ 
signed positive e, which will serve for all values of x in the interval. The 
series is then said to be uniformly convergent. 

Definitk^n. The series Xun(x) is uniformly convergent in the interval 
[a,h] if for each c > 0 there is a number N, independent of x, ^uch that the 
remainder r„(x) satisfies lrn(x)j < € jor all n > N. 

It is the words in boldface type that give the whole distinction between 
ordinary convergence and uniform convergence. 

To illustrate this distinction in a .specific case, we siiall discuss the geometric series 

so 

53 on the interval — J 2 < 3: < 

According to the result of Sec. 1, Example 2, the sum, partial sum, and remainder 
are, respectively, 

j 1 — X** X** 

«(x) » -» 8n{x) « - -» fnfx) *« -- (7-3) . 

1—X J.-“X 1—“X 
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Tlie oondiUon lr«(ar)l < < gives lx”] < •{! - x) or, upon taking the lo,;arithm and 
solving lor n, 

(7-4) 

log la:I 


Again it appears that the choice of N depends on both x and e, but in this case it is 
possible to choose an N that will serve for all values of a: in f — Hi HI* Given a small €, 
the ratio log «(1 ~ a:)/log |a:| assumes its maximum value when a: *■ +H* Hence if 


N is chosen so that 

AT > _ log < 

logH ” log2 


then the inequality (7-4) will be satisfied for all n > N. 

Upon recalling the conditions for uniform convergence, we see that the series Sx" 
converges uniformly for —H < a: < H- However, the series does not converge uni¬ 
formly in the interval { — 1,1), for, in this interval, the ratio appearing in (7-4) will in¬ 
crease indefinitely as x approaches the values ± 1. 


Generally speaking, any test for convergence becomes a test for uniform 
convergence provided its conditions are satisfied uniformly, that is, inde¬ 
pendently of X. For instance, the ratio tCvSt takes the fonn: If there is a 
number r independent of x such that for all large n 

Unix) 

then ^Unix) converges uniformly. Similarly, the comparison test takes 
the form: If St;n(x) is a uniformly convergent series such that IwnWI < 
Vn{X)f then Swn(^) converges uniformly. The simplest example of a uni- 
fomily convergent series llvnix) is a series of constants. Clioosing such a 
series in the comparison test, we are led to the so-called Weierstrass M test: 

Theorem I. If there is a convergent series of constants, ZMn, such tiiat 
l^'nix )! < Mn for all values of x on [a,hj, then the series Zunix) is uniformly 
(and absolutely) convergent on [a,b]. 

The proof is simple. Since ZMn is convergent, for any prescribed e > 0 
there is an AT such that 


< r < 1, 


Mn^\ + Afn 4-2 + A/n-f-s < e for all n > N. 

By the ordinary comparison test Sr„(x) converges for each x, so that rn(x) 
is well defined. We have, moreover, 

|rn(x)| = lUn+l(x) + U„-(. 2 (x) H-| < |«n+l(x)| + !Mn+3(x)| H- 

< Mn+l + Mn+2 H-- < C 

for all n > iV*. Since N does not depend on x, this establishes the theorem. 
The other tests for uniform convergence mentioned above are established 
similarly. 
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The fact that the Weierstrass test establishes the absolute convergence, 
as well as the uniform convergence, of a series means that it is applicable 
only to series which converge absolutely. There are other tests that are 
not so restricted, but these tests are more complex. It should be empha¬ 
sized that a series may converge uniformly but not absolutely, and vice 
versa. 

^ sin 713J 

To illustrate the use of the M test consider the series ^ —r— Since | sin Tta; 1 <1 

nu>l ^ 

for all values of x, the convergent series 2l/n* will serve as an M series. It follows that 
2(Bin nx)/n* is uniformly and absolutely convergent on every interval, no matter how 
large. 

For another example consider the geometric series Sar”. In any interval [—o,a] with 
0 < a < 1 the series of positive constants 2a" could be used as an M senes, since 
lz"l < a" on the given interval and since 2a" converges 

The importance of uniform convergence rests upon the following 
theorems: 

Theorem II. Let Xuk(x) he a series such that each Uk{x) is a continuous 
function of x in the interval [a,6]. If the series is uniformly convergent in 
[a,5], then the sum of the series is also a continuous function of x in [a,?)]. 

Theorem III. If a series of continuous functions Zunix) converges uni¬ 
formly to 8(x) in [a,?)], then 

/ s(x) dx — Ui(x) dx+ U 2 (x) dx -i -f- / ujx) dx -, 

J<x Ja 'a *a 

where a < a < h and a < ^ < b. Moreover^ the convergence is uniform with 
respect to a and 

Theorem IV. Let Hukix) he a series of differentiable functions that con¬ 
verges to s{x) in [a,6]. If the series converges uniformly in [a,6], then 

it converges to s\x). 

The proof is not difficult, and serves well to illustrate the idea of uniform convergence 
(see words in boldface). In Theorem II, if x and x h are on [a,6], we have 

8{x) « «n(a;) 4* rn(x), 

six + A) ** Snix + A) 4* r„(x + h), 

and hence 

s(x + h) - six) * Snix 4* /l) - Snix) 4- Tnix 4* A) - Tnix). (7-5) 

Given « > 0, pick n so that |rn(01 < < for all t on [afi]. Now, »n(x) is a finite sum of 
continuous functions, hence continuous. Therefore 

\Snix 4-A) -Snix)\ < € 

whenever |A| is sufficiently small. From (7-5) it follows that 

|«(a; 4* S !«»»(* 4* A) — 8«(x)l 4- |rn(x + ^)| 4- \rnix)\ 

< c -f e 4* « . 
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This ehowfi that [ s(x 4* A) — six) | becomes arbitrarily small provided | AI is sufficiently 
rnnall, and hence six) is continuous. 

For Theorem III, note that «(jr) and tnix) artj continuous by Theorem II. Hence 


j six) dx ^ j Snix) dx -4 
hat |rn(jj)| < c 
j j six) dx — j Snix) dx j < j e dx 


fr. 

Jet 


(x) dx. 

If we choose n so large that | r„(x) | < c for all x on (a,^>], then 


|j9 — aj t < (b — a)e. 


Since the finite sum s„(x) can be integrated term by term and aim e (6 ~ a)e is arbitrarily 
small independently of a and d, the desired result follows Theorem IV follows from 
Theorem III when w;t(x) is continuous; * we simply write down the differentiated series 
and integrate term by term. 

A geometric interpretation of uniform convergen(je may be obtained by 

considering the graphs of y = six) 
and of the /ith approximating curves 
// = The condition |r„(x)| 

< e is equivalent to 

s(x) — e < Sn(^) < + € (7-6) 

wliich means that the graph of 
7/ — Sn(x) lies in a strip of width 2c 
centered on the graph of y = s(x) 
isee Fig 4). No matter how narrow 
the strip may be, this condition 
must liold for all sufficiently large n; 
otherwise the convergence is not unifonn. 

With such an interpretation, many facts about uniform convergence 
become rather obvious. For example, the conclusion of Theorem III is 



/ s(a:) dx == lim / Sn{x) dx 

Jot n —♦ 00 


(7-7) 


and the truth of (7-7) is strongly suggested by considering appropriate 
areas in Fig. 4. 

A graphical illustration of nonuniform convergence is given in Fig. 5, 
Here, the partial sums 




n^x 


1 -f n^x!^ 

are plotted for n = 3, 5, and 10. By inspection of (7-8) 
8(x) = lim Sn(x) #« 0, — 00 < X < 00. 


(7-8) 


* A proof free of this restriction is given in K. Knopp, “Theory and Application oi 
Infinite Series/’ p. 343, Blackie k Son, lAd., Glasgow. 
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Nevertheless the approximating curves (7-8) have peaks near x ^ 0 which 
grow higher with increasing n. Since y = Sn(/) does not lie in a strip 
--€< y < e for arbitrarily small ^ e and all large n, the convergence is not 
umlorm in any interval containing tho point x = 0. 

By looking at Fig 5 one cannot (‘asily sec whether the artms under the 
curveij y - Sn(x) tend to 0 or not; that is, one cannot tell whether (7-7) 



holds or not A short ca]<‘ulation l)as(‘d on (7-(S) sIutvs that, in fact, (7-7) 
does hold Thus, th(' con<lusi(»n of HuonMU III m<i\ be true even wlicn 
the c()n\('rg<mce is not unitoiin It i-« left jor tlu' sfuthiit to verify tliat 
(7-7) does not liold when a ~ 0, d - 1. and, instead of (7-8), 


6r,(x) = " 


1 T //“j " log Af 


(7-9) 


The graplis of y = 6„('r) in (7-9) give a iigiiri' (]Uite similar to Fig. 5. 


PROBLEMS 

1. (’h(‘ piitial sum- of I senos au s/r) ^ r" Slu'W tint Iho senes is uniformly 
(onv<T};uil in tlip int( ivy'll [(), ^'2! 

2. By usm^ tl)e dc^hnitiou of unifonu joiucir^ nn*, slum that 

I _ 1 _ _ _ 

j -f- 1 (x + l)(.r -f 2) (j- -f- a — DC-r -f n) 

* In this caw the condition do(*s not even hold for laigc vuUkh of €. 
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Is umformly convergent in the interval 0 < 1, Hint: Rewrite the series to show that 

9%{x) «» l/(ar + n) and therefore »«(z) — «(*) *« l/(i» -f «). See Prob. 1, Sec. 1. 

3 . Test the following series for uniform convergence: 

S SdOx)". 2n(8m x)». X 


4 . Test for uniform convergence the series obtained by term-by-term differentiation 
of the four series given in Prob. 3. 

5. Plot the sequence Snix) «■ nx/(l -f- nx) versus x for 0 < x < 1 and for n 10, 
100, 1,000. Does lim Sn(x) «» «(x) exist for every x? Is the convergence uniform on 


0 ^x < 1? 


Is «(x) continuous? Does lim 


j Snix) dx 



dx for all a, 0 on [0,1]? 


6, If Sn(x) ■» 2iixc”~"**, 0 < X < 1, show that 

lim / 8n(x) dx -- I lim 8n{x) dx » 1. 

n -* 'K> Jo Jo n » 


Is the convergence Sn(a:) -♦ «(x) uniform? 


Problem for Review 

7. Show that converges absolutely if lim | On | ** r < 1. Hint: Choose r' so 
that r < f' < 1. Then V^lttnl <^' for sufficiently large n, and hence |a«l < (r')". 


POWER SERIES AND TAYLOR’S FORMULA 


8. Properties of Power Series. One of the most important types of 
infinite series is the power series ^ 

00 

a„x" = 00 + 01* + 02** H-f- a„x” - ( 8 - 1 ) 

AasrO 

SO called because it is arranged in ascending powers of the variable. Typi¬ 
cal examples are given by the three series ^ 

Sx^nl,-» Sx", (8-2) 

nf 


which were already encountered in the foregoing sections. 

For many power series the region of convergence is easily determined by 
means of the ratio test. In the first series (8-2), for instance, the ratio of 
two successive terms leads to 


x”n! 


- 1)1 


\xn\ « \x\n 


for X 5 *^ 0 


^ Throughout Secs. 8 to 14, 2 means X rather than 

0 i 

* It is customary to take 0! «» 1, so that the relation n! *■ n(n — 1)1 will hold for 
•w 1 as well as for n » 2, 3, 4, .... 
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and hence the series converges only for a: = 0. In just the same way it is 
found that the second series gives a ratio lx|/n, which approaches zero. 
Hence the second scries converges for ail x. The third series is the geo¬ 
metric series, which, as we know, converges for |x| <1. 

It is a remarkable fact that every power series, without exception, be¬ 
haves like one of these three examples. The series converges for x « 0 
only, or it converges for all .r, or there is a number r such that * the series 
converges whenever | x | < r hut diverges whenever | x | > r. The number 
r is called the radius of convergence, and the interval | x [ < r is called the 
interval of convergence. The fact that every power series has an interval of 
convergence may be deduced ^ from the following theorem: 

Theorem I. If Sunx’* converges for a particular value x = Xo, then the 
series converges absolutely whenever |x| < |xo! and uniformly in the interval 
i^l < \^i\ for each fixed Xi such that |xi| < |xo| And if it diverges for 
X = Xo, then it diverges for all x such that |a'| > )xo|. 

To establish Theorem I, observ'e that liin UnxS = 0, since XanXQ con¬ 
verges (Sec. 1, Theorem 1). Hence (0,^x21 <1 for all sufficiently large n, or 


1 

junl <- for all n > N, say. 

lA)i" 


(8-3) 


This shows that 2|an| l.r|'^ converges by comparison with the geometric 
series 



provided |x| < |xol. The statement concerning uniform convergence is 
established by the saiiK' calculation, since w(l.rjl 'lxo|)'‘ serves as an M 
series for the Wenerst ras^ M test. Finally, th(‘ statement concerning diver¬ 
gence follows from the lesult on convergence. That is, if the series con¬ 
verged for X, it would have to converge for X(>, since |xo| < |x|, and this 
is contrary to the hypothesis 

The uniform convergence mentioned in Theorem I shows that a power 
series represents a conlimioiKs fuiuaion for all values of x interior to its 
interval of convergence (see Theorem II, 8e(‘. 7) For instance, 2x” = 
1/(1 — x) is (‘ontimious for Ixj < 1, though not at x = 1. We shall soon 
see that such functions not only are conlinuoiis but have derivatives of all 
orders and the derivatives can be found by termwise differentiation of 
the series. 

* For simplicity of nomeucliiturc one may incorporate the first two cases into the third 
by allowing r ~ 0 and r « oo. The case r « 0 arises when the scries converges for x « 0 
only, whereas r « « if the series converges for all x. 

* A complete discussion is given in Sec. 16. 
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As m illustration of this fact consider the geometric series mentioned 
above. Term-by-term differentiation yields the series Because 

of the coefficient n, which tends to infinity, one might expect the latter 
series to have a smaller interval of convergence than the former. Actually, 
however, the intervals are the same. Since 



1 

1 n 

(n - 


X 

1 n — 1 


IX1, as n —> 00 , 


the ratio test shows that the differentiated series, like the original series, 
has the interval of convergence |x| < 1. A similar result is found if we 
differentiate repeatedly. Each differentiation multiplies the ratio by 
n/(n — 1). Inasmuch as n/(n — 1) —> 1, thLs factor does not change the 
limit of the ratio, hence dcKis not change tlie interval of convergence. 

For many power series the ratio |a„^_i/a„| has no limit as n c©, and 
the foregoing analysis does not apply. However, suppose the series (S-l) 
converges for some value x — xo 5 *^ 0, so that, as before, we have the esti¬ 
mate (8-3). If |x| < |xo|, the differentiated series 2^na„x”'~^ converges 
by comparison with 



(Note that the latter series was shown to be convergent in the previous 
paragraph.) The same calculation establishes uniform convergence of the 
derivative series if larj < lxi| < |xo|, since 




n 



serves as an AT series for the Weierstrass M test. Hence, the result of the 
differentiation is actually the derivative of the original series 2a„x’’ (see 
Theorem IV, Sec. 7). 

The foregoing argument is practically identical with that used to prove 
Theorem I. A third use of the same method establishes the corresponding 
result for the integrated series llanX^'^^/{n + 1). In this case the compari¬ 
son series are, respectively, 


, 1 /i^iy 

n + 1 \|xo|/ 


and 


n + 1 \|xo|/ 


Summarizing this discussion we can state the following, which is perhaps 
the most important and useful result in the whole lheor>' of powr3r series: 

Theoeem II. A power series may be differentiated (or integrated) Urm 
by term in any interval iu^terior to its interval of convergence. The resulting 
series has the same interval of convergence as the original series and represents 
the derivative (or integral) of the function to which the original series converges. 
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Consider, for example, the geometric eeries 

(I - x)~^ “ 1 + X + X* -+ *"+•••, |x| < 1. (8-4) 

Differentiating termwise we obtain 

(1 ~ «« 1 + 2 x 4 - H-h nx^~^ -f • • •, kl < 1 . ( 8 - 5 ) 

Differentiating again gives an expansion for (1 — r) and so on Since the series (S-4) 
converges for l^j < !, Tlieorem 11 shows unthoui furthr diacunsion that all these othciT 
expansions are also valid for jj:’! <1 

On the other hand, if the series (8-4) is integraterl termwise from zero to j, there results 
an expansion 

-log (1 - x) « a: 4- " + - -I-h-1-, |j| <1, (8-6) 

z o n 

which can l>e used for numerical compulation of the logarithm. 

h]quations (S-4) to (8-t>) giv-^e iiowcr-seriivs n^prescntations for the func¬ 
tions on the left. It will now be ehttilili.shed that such representations arc 
always uni<iue 

4'hiookem III. If hw poiver serus conv(rg( to the same sum throughout 
an irtieroal, th(n correspotnbng coi jfu ients are (gnai 

For proof, assume that so that, by (2-3), 

0 — [do — bo) + (fli — bi)x -f* f«2 bj).r -j- ‘ • -j- (un hn)x*' + • * '. 

The clioice x -= 0 yitdds uo — ho i-liffen'iitialing with n^speci to .r yields 

0 = (ai - hi) 4- 2(^2 b2)x 4-b n(a,^ - "^4- 


and if we now set x - 0, wo get Oi ~ /q. Fp<>^^ differentlaling again and 

setting X - 0, ^s'(' get ~ bj, and so on. 

This process not only shows that the coetlicHuits are unujiiely determined 
but yields a simjde formula for their values Tx)t 

/(.r) = do + aix + ( 22 ^^“ 4-.. 4- 4-, for |xj < xq. 


Upon dilTereutialing n times wo g(‘t 

- 0 + 0 -f 0 H--f 0 f // !«„ 4- 

where the second group of terms '"4— ■” involves x, x^, or higher powers. 
These terms disappear when we set x - 0, and hence= nldny or 


j(rO(0) 


(8-7) 


In the following section vve shall be led to the same formula (8-7), though 
by an entirely dilTereut method 

The algebraic properties described in Sec. 2 for series in general give 
corresponding properties for power series: Two power series may be added 
term by term, a power series may be multiplied by a constant, and so on. 
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Since power series converge absolutely in the interval of convergence, The¬ 
orem IV, Sec. 6, yields the following additional property: 

Theorem IV. Two power series may be multiplied like polynomials for 
values X which are interior to both intervals of convergence. Thus, 


(XanXmXbnX^) = SCnX", 


where C„ «= Oofen + + U 2 ^n --2 + * * * + On^O- 


So far, nothing has been said about the behavior of power series at the ends of the 
interval of convergence. As a matter of fact all behaviors are possible. For example, 
each of the series 





00 

z 

1 


n 


has |x| < 1 as interval of convergence. However, the first series converges at a: « 1 
and ““1, the second diverges at x « 1 and —1, the third converges at x » —1 but 
diverges at x « 1, and the fourth diverges at x « —1 but converges at x « 1. 

For applications, the most important theorem concerning the behavior at the ends 
of the convergence interval is Abel’s theorem ' on continuity of power series, which 
reads as follows: 

Ab|}x<’s Theorem. Suppose the power series Xonx" converges for x xo, where xo may 
be an end point of the interval of convergence. Then 

lim SOnX” « Xonx" 

* -^*0 


provided x —► Xo through values interior to the interval of convergence 
To illustrate the theorem, lot x —> —1 through values greater than —1 in the series 
(8-fi), The limit of the left side is — log 2, since the logarithm is continuous, and the 


limit of the right side is ^ 
1 


(-ir 


by virtue of Abel’s theorem. Hence, 


log 2 


1 i 1 

2 3 4 n 


As another example of Abel’s theorem, let x —♦ 1 in Theorem IV to obtain the fol¬ 
lowing: // 


Cn Uchn + Ol^n-l H-h Un^O, 


then (Zan)(^hn) •* Xcn provided each series is convergent. Hence, with the particular 
arrangement of the product series which is given by Xcn, we do not need absolute con¬ 
vergence as in Theorem IV of Sec. 6. 


PROBLEMS 


1. Find the interval of convergence, and determine the behavior at the end points 
of the interval: 


3.2n 5.2n4-l 



* A proof is given in I. S. Sokolmkoff, '‘Advanced Calculus,” pp. 278-279, McGraw- 
Hill Book Company, Inc., New York, 1939. 
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2. Show that the radius of convergence of is given by r *■ lim lon/On+il 

ft «0 

whenever this limit exists. 

3. (a) By letting x » in (S-4), obtain the expansion 

—i-r-l - <2 4.... 

1 4 - < 


(b) By integrating from zero to x, obtain an expansion for tan 
suit (b), show that 



(c) Using your re- 


4 . (a) Show that the series 1 / =** satisfies y' « y. (h) Deduce an expansion for 

e^. For what values of x is this expansion vahd’^ fc) Obtain series exj)an.sioris for e and 
1/e by taking x « =hl in (6) (d) Using youi seia-,, compute e and l/e to three signifi¬ 

cant figures, and clieck your vvork by finding the pioduct c(l/f’). 

6. Using results given in the text, express the following integials as power series: 


r.A , r 

Jo l -f Jo 


log (1 4 U) df, 


i 


t (It 

0 (1- tV 


Hint' In the third casi*, for example, let x - in (8-5), multiply through by f®, and 
finally int(‘grate term by term 

6. By multiplication of senes obtain the expaiiMon of (1 4“ x 4- 4~ • • * + a;” 4- * * *)^' 

In particular, compute the coidficieiits of 1, x, x^ x^, and x" in the product series. 


9. Taylor’s Formula. The usefulness oi power series is greatly increased 
by the so-called Taylor formula, which yiMa the [)owor-senos expansion for 
an arbitrary fund ion f{x) together with an expression for the remainder 
after n terms Let /(.r) be a function with a coritiimous nth derivative 
throughout the interval Taylor’s formula is obtained by integrating 

this nth derivative n times in succession between the limits a and x, where 
X IS any point on [a,b]. Thus, 


f fA'\x) (dxf = dx dx 

= A-^\z) - (r - a)/<"-'>(o) 

r r -/<"-®’(a) - (x- 

Ja Jq Ja 

- (a) 

2! 


(a-- a)"-‘ 


(« - 1)! 
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Solviiig for/(x) gives 2 

/(X) » m + {x- a)m + - ^ -/»(a) 

Jtl 

(x ~~ 

+ • • ’ + (94) 

(7i — 1)! 

where -Rn •= / ... f (9-2) 

The formula given by (9-1) is know as Taylor’s formula, and the particu¬ 
lar form of Rn given in (9-2) is called the integral forvi of the remainder after 
n terms. The Lagrangian form of the remainder, which is often more use¬ 
ful, is 

(x — a)" , , 

Rn = ^^a < f < X. (9-3) 

n\ 


To derive this from the form (9-2), let M be the maximum and m the minimum of 
for a < < < I, Then the integral (9-2) clearly lies between 

f ... f M (cb-)" and f ... f m (dr)". 

J a J d J a J Q 

Upon carrying out the integration we find that these bounds are 


(x ~ aY 


M and 


(T - O)” 


* m, 


respectively. Since the continuous function/^"^(O assumes all values between its maxi¬ 
mum M and minimum w, there must be a number / ^ such that (9-3) holds We 

have \>sntten our ine(|ualities for the ease a < x; in any case, ^ is bet\^eerl a and x. 


In general the remainder Rn depends on x, as is obvious from the repn'- 
sentation (9-2). It may haj)])eri, however, that f{x) has derivatives of all 
orders and that the remainder R^ approaches zero as n oo for each 
value X on [«,?>]• lii this case ve obtain a representation for f(x) as an in¬ 
finite series 


fix) 


£. /<">(a)(x - g)" 

n ^ * 


(9-4) 


and Rn now gives the error which arises when the series is approximated by 
its nth partial sum. The series in (9-4) is called the Taylor scries for f{x) 
about the point x = o. The special case 


/(X) - 


.^/‘">(0)x" 

„-0 n! 


(9-5) 


is oft,en called Maclaurin's series, though Taylor’s work preceded Mac- 
laurin’s. 
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To ilhistrato the iiw of Taylor’s formula, lot /(jr) Then f'(x) 

*atul honec/^'^^O) «> 3 E<|uation (0-5) 8ug:t?oKts that 


gjr 


x” 


145 

nx) * 

(0-0) 


and indeed, bv the ratio test this series converges for all x. However, to show that it 
converges to c® w'e must consider the remainder, wliich takes the form 


n! 


(9-7) 


wh<‘n wo use (9-3). Since this approaches zero ^ as n oo, the siTies dof*s converge to c®. 

As anot her example we find the expansion of cos x in powin’s of x - {ir/2). The values 
of /, .r, f\ /'" are, rc.sjM‘C‘tively, 


cos X, —sin X, —cos x, sin x. (9-8) 

Sin((' tli<- next term is/’'" » cosx, the next four derivatives r<'p(‘at the se(ju(‘nee (9-8), 
tlie iH'Xt four repeat again, and so on 1 !»'aluatiiig at x — 2 we get, respectively, 

0 , - 1 , 0 , 1 , 0 , - 1 , 0 , 1 ; 0 , - 1 , 0 , 1 , . . 


and hence Kip (9-4) suggests tlie expansion 




(9-il) 


To determine if the scales coiivmges to the lunction on the hdt, w(‘ consider the re- 
HiHind('r after n {(ams Now, (0-8) gi\es/''‘Xx) — -b •'in x or rf cosx, so that (9-3) 
implies ' l\n \ <\J — IT a’ Since hm /iN = 0, the ex])ansion (9-9) is valid 

Upon setting X - Tr/2 ~ t and noting (hat ros(7r 2 — /) — mu/, w'e get an ex'pan- 


sion lor sill t 


sin t ^ I 


/•’ 

— -f- — — T 

3’ :d 


. 4- ^ ^ - - 4-. 

(2a f 3)’ 


(9-10/ 


which is consistimt with (9-5) It is Udt as an exercise lor the ri'ader to obtain a similar 
expansion for the cosim* by um‘ of (9-5) and (9-3). 


cos X = I 



+ 


(~1)V“^‘ 
C2a)» " 


(9-11) 


In these examples the fact that the series converges to the function was 
established by direct examination of R„. Such examination is necessary 
ev('ii vdien the series is found to be convergent by other means. For ex- 
arajile, if wo. define 

fix) = /(O) = 0, 


it (‘an be shovn that the Taylor series about x = 0 (*onvorges for all x but 
converges to/(.T) only when x = 0. The trouble with this function is that 
it does not admit any power-series expansion valid over an interval con¬ 
taining a; - 0, and we have the follotving: 

‘ The fact that (9-6) converges shows that z^/n\ 0. (Cf. Prob. 5, Sec. 6.) 
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Theorem I. Suppose a function f(x) admits a series representation in 
powers of X -- Uy so thatf(x) » Xan(x — a)^ for some interval (x — a| < c. 
Then the Taylor series generated by f{x) coincides with the given expansion 
(and hence, the Taylor series converges to /(x)]. 

For proof, differentiate ^ n times and set x = a, just as in the discussion 
of (8-7). It will be found that an == f^'^\a)/n\, and hence the given series 
is identical with the Taylor series. 

Theorem I shows that a valid power-series expansion obtained by any 
method whatever must coincide with the Taylor series. For instance, to 
find the Taylor series for sin x^ about x = 0, we set < = x^ in (9-10). This 
is far simpler than direct use of Taylor's formula, as the reader can verify. 


Example: Obtain the expansion of 


in powers of x 
With t « X 


1 . 


m 


1 

(x - 2)(x - 3) 


1, the given function becomes 


1 -1 1 111 
(i - 1)(< - 2) “ « - 1 - 2 ” 1 - t 21 - 'At 

when we use partial fractions and the known formula for sum of a geometric series. 
Upon recalling that i ** x — 1, we get the required result 

Since the two geometric series (9-12) converge for ji] <1 and \t\ <2, respectively, 
the expansion (9-13) is valid for |x — 11 < 1. By Theorem I, this expansion coincides 
with the Taylor series. 

PROBLEMS 


1 . For the following functions find the Taylor scries about the point x •* 0 and also 
about the point x « 1; 

sin TTX, cos (x — 1), 2 4- x’, (x 2)“"^. 

2 . (a) Expand c* about the point x « o by writing c* » and using (9-6). 

(6) Expand logx about x 1 by writing logx « log [1 — (1 x)] and using (8-6). 
(c) Obtain the general Taylor series from Maclaurin’s. Hint' If g(t) =- f{a -j- i), then 

/(a -f 0 - git) - 


Now let i * X ~ a. 

‘ The fact that the series now considered are in powers of x — a rather than x causes 
no trouble. By a simple translation of axes, f » x — a, these series become power series 
of the type considered in the preceding section, hence are subject to the theorems of the 
preceding section. 
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3. Expand the following fractions about the point » 1: 

1 1 _^_ 

z x^ — A x{z^ — 4 ) 

4. Show that the Taylor series for sin x in powers of i — a converges to sin x for every 
value of X and o, and find the cxjMinsion of sin x in powers of a; — tt/G. 

6. Obtain the Maclaurin aeries for cos x by differentiating the series for sin z, 

6. By means of the known senes for c“, sin u, and log (1 — w), find Taylor's expansions 
for 

sin x*, c* + c"* log (1 + x^). 

7. What is the Taylor series for (1 + x)^ if p is constant? Find the interval of con¬ 
vergence, and discuss the absolute convergence at the end points of the interval Hint: 
Use Theorem II6, Sec. 5. Analysis of the remainder Hn is difficult and may be omitted, 
A proof that the series converges to (1 -f will be found in Sec, 12. 


10. The Expression of Integrals as Infinite Series. Many difficult in¬ 
tegrals can be represented as power series. For example, if we let x ^ 
in the series (9-11) for cos x, we get 


cos r 


1 -+ - + ■ 

21 4! 


and hence, integrating term by term, 

^5 


/: 


cos dl - X 


5-21 


+ ‘ 


(_l)n^4n 

•H-+ ■ 

(2n)! 


(_^|)n^4n+l 

^ (in + ]) (2n)! 


( 10 - 1 ) 


This integral is calh^d the Fresnel cosine integral; it is important in the theory 
of diffraction. Although the Fresnel integral is not expressible in terms of 
elementary functions in closed form, tlie expansion (10-1) is valid for all x 
and gives a representation which is entirely adequate for many purposes 
Sometimes one may obtain a power series involving a parameter rather 
than the variable of integration as in the last example. To illustrate this 
possibility w e shall express the arc length of an ellipse as a poorer series in 
the eccentricity /c. If the equation of the ellipse is given in parametric form 
as 

X = a sin y ^ h cos a > 6, 


then the arc s satisfies 


= dx^ + dy^ — (a^ cos^ B + \)^ sin^ B) dB^, 

Upon noting that cos^ 0=1— sin^ 0, we obtain 
ds = o a/i — /c2 si 11 ^ 0 dB, 

where k ^ (a^ -- IF) ^/a is the eccentricity. Hence, the arc from 0 « 0 to 
0 «= 4>is 


« == a r V^l — B dB s aE(k,<t>). 

Jo 
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The integral defined by this equation is called the elliptic integral of 

the mond kind. Although is not elementary, it may be expressed as 

a power series. 

By the binomial theorem (Sec. 12) 

(1 - sin^ e)^ ^ I sin^ B - )4kf sin^ B - (10-2) 

for k^ < 1, which is the case when h ^ 0. Since 
1 + }^k^ + ygk* -f.. •. 

serves as an M series, (10-2) is uniformly convergent and term-by-term 
integration is permissible. Hence we obtain the desired expression 

1 

E(k,i>) ^ 4 ) -siu'^ B do -/ sin'^ 6 dd ~~ • ♦ • 

2 Jo 


1-3-5 ... (2n - 3) 
2-4-6 ... 2n 



In a similar manner it can l>e shown that the elliptic inkyrul of the Jirnl hind (id 
Example 3, Sec. 20, Chap 1), 

r de 

V 1 — 8in^ 0 

has the expansion 

1 1-3 

f'Xk.th) 4> d' I sm^edd d -- A:^ / am^ B do H- 

2 Jo 2 4 Jo 

1-3 5...(2n- 


The elliptic inlegral of the third kind is 


2*4«6...2n Jo 


U{n,k,<t>) 


r — 

io (I 4- n 


do 


sin^ sin^ 0 


and this, too, can be expressed as a stories by expanding the radical. 

Any integral of the form 

J{aBmz + b cos x + dx 

or of the form 

jR(x,\^x^ + hx^ + cx^ + dx + e) dx, E{x,y) = rational function, 

is expressible in terms of the elliptic integrals ^ together with elementary 
functions. For this reason elliptic integrals have great practical impor¬ 
tance and have been extensively tabulated. 

* See, for example, P, Franklin, ^'Methods of Advanced Calculus,” chap. 7, McGraw- 
Hill Book Company, Inc,, New York, 1944. 




sue. Ill 


POWEK SEHIBS ANI> TAYLOH S FOKMUBA 


149 


PROBLEMS 


- dl. 


1. Expand the following integrak as power series: 

rdn{e)dt, f~c 

Jo Jo t Jo Jo t 

2 . Express / as a powcT series in x. Hint By Wailis' formula, 

Jo 

I sin” i 
Jo 


‘ t dt 


(ri — l)(i/ — ,3) 


2 or 1 


n(n — 2) ... 2 or 1 


where a =* 2 if n is odd and tv « r if n is even. 
S. Exiin'ss the mromphie gamma function 


Jo 


dt 


as a series in powers of x. For wliat values of x and p is your expansion valid? 
4 . Tlie beta function is defined by 


B{p 


S - /'Vhi - 

Jo 


xy-^ dx. 


Kxpn‘S8 this a& a sf'ries by using the bmonual theorem for fl — and integrating 
term tiy li'rm h'ur what vshu‘s of p and q is the re.sulting series nl>sohitely (*onverg(‘nt? 
(S(‘e Tiieoiem Mb, See 5. Although the range of int(‘giMlion includes the value x «» 1, 
v; Inch in an < nd point (»f t he eonv('rgene<" interval, the integiation is easily jiibtified. Thus, 

one iiiight eonsuler / and let x -♦ 1 througli values less than 1. The desired result 
Jo 

then follows fiom Abel's theorian, St'e. 8 ) 


11. Approximation by Means of Taylor^s Formula. If a function fix) 
has a convergent Taylor seri(‘s, tlnui the partial sums of that series can be 
used to approximate the function. In this way, calculations of great in¬ 
trinsic cornplexit.v an* roductnl to calculatiors invohing [)olynornials. The 
method is especially important because Taylor's tormula not only gives a 
polynomial approximation but gives a nutans of estimating the error. 
Thus, the remainder /?„ in (9-2) and (9-^^) is precisely tlie difference be¬ 
tween f(x) and the nth partial sum of its Taylor series. 

'To illustrate the use of Taylor's senes for numerical computation, let 
us find sin 10° williin =bl0~^ The value 10° *= 7r/18 radian is closer to 
zero than to any other value of x for which sin ,r and its derivatives are 
easily found, and hence the expansion is taken about the point x = 0. To 
estimate the number of terms retpiired, (9-3) gives 


|i4l 


/<"’(«) „ 

-X < — 

n! n! 



(0.175)” 


n! 


(IM) 


when a — Of and when we set x = w/lS = 0.175 and recall that sin x to¬ 
gether with its derivatives is lass than 1 in magnitude. The successive 
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bounds for Rn as given by (11-1) may be computed recursively; indeed, the 
nth value is obtained by applying a factor (0.175/ri>) to the preceding one. 
For n » 1 the bound (ll-l) is 0.175 and the next few are as follows: 


Value of n. 

2 

3 

4 

5 

6 

Bound for j iin 1. 

1.5 X 10-* 

8.8 X lO""* 

3.9 X 10-* 

1.4 X 10-« 

4.0 X 10~* 


From a list such as this the n sufficient for any prescribed accuracy can be 
determined at once. In particular, an accuracy of dbl0“^ is found if we 
take n « 6. Thus 


sm 


IT /ttX^I /irX®! 

“ 18 ~ (is) 3! (li) ^ ^ 


where | jRel 4.0 X 10“®; more explicitly, 0 > Re > —4.0 X 10“®, since 
=K — sin $ < 0. Inasmuch as the next term of the series is zero, 
the first six terms are the same as the first seven f£rms. Hence the error is 
also equal to Rj, where 

0>R7> ~1.0 X 10“®. (ll~2) 

An improvement of accuracy such as this is to be expected whenever the 
series is terminated just before one or more terms with zero coefficients. 

In modern computing practice an automatic computing machine is so programmed 
that it keeps track of the remainder, which can of ton be estimated recursively as in 
this example. The machine is then instructed to take as many terms as are needed to 
make the remainder less than some preassigned amount. This process was illustrat/ed 
in the foregoing calculation, where the value n *» 6 was chosen, not at random, but by 
consideration of the desired accuracy. 

The reader may have noticed that the series for sin (ir/18) is an alternating series with 
terms decreasing in magnitude. Hence, the estimate for the error (11-2) could have 
been found by Theorem II of Sec, 6. Taylor’s formula, however, has the merit of apply¬ 
ing to general power series, whether alternating or not. 

Many important approximations are obtained by using the first few 
terms of the Taylor-series expansion instead of the function itself. For ex¬ 
ample, the fonnula 

k ^ 2/"[l + 


for curvature of the curve whose equation is i/ ~ f{x) yields 


k *= 



3 „ 13 5 

iy') + -( 2 /')^ 

2 ! 2 2 ^^^ 



when we use the binomial theorem. The first-term approximation k ^ is 
sufficient for most applications. 

As another example, in railroad surveying it is frequently useful to know 
the difference between the length of a circular arc and the length of the 
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corresponding chord. Lei r be the radius of curvature of the arc AB 
(Fig. 6), and let a be the angle intercepted by the arc. Then, if s is the 
length of the arc ^4.^ and c is the length of the 
chord ABf s ^ ra and c = 2r sin 34a. Since 


sin X « X-!— cos 

3! 51 

where 0 < f < x, the error in using only the first 
two terms of the expansion is certainly los.s than 
x'’’/5I. Then, 


c = 2r sin ■ 


with an error less than 


2r 


/a \ 

\2 ~ 8^)/ 



2r 


/ \ ra'*' 

\32 120/ 1,920* 

Therefore, s — c — a^r/24 with an error that is less than ra^''/l,920. 

Example 1. For the nonelementary integral / du obtain a polynomial approxima- 

Jo 


tiori valid within -i-0 00001 when 0 < x < 
According to (9-6) and (9-7), 

x""*’ 


1+X + I-, + 0<{<I. 

{n — 1)! n! 


If we set X » u^, this bc*(*omes 


« 1 4- +•••+.— rr; + o < ^ < u^, 

2! (h — 1)’ n! 

and integrating from 0 to x yields 


/> 

Jo 


du X A -1-{-1- 

^ 3 5 2!^ ^ (2n - l)(n 


Jo nT 


du. 


To estimate the integral on the right we note that 
since ^ < w* and u < x. Hence 


It follows that if we write 


f du < du 

Jo Jo 

{■X 00 

I du ^ 7^— 

Jo n«i (2 h — 




2n -}- 1 


l)(n ~ 1)! 


then the error iZsn after the term x®” ' satisfies 

0<Rin< 


n!(2n + 1) 
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For AH approximation valid within dt0*00001 when 0 ^ a? £ Ji, w© choose n large 
enough to make 

^ < 0 . 00002 . 

nf(2n “h 1) 

Since < 1.3, the above condition is satisfied when 

nl(2n + 1)2®’*+^ > 65,000. 


By trial we find that n » 4 suffices. This choice of n yields an approximation which 
is too small by 0.00002 at most. If 0.00001 is added to the approximation, we get 





(U-3) 


within dbO.OOOOl when 0 < x < H- 

Example 2. Obtain a polynomial approximation for / * dx valid near x * 0. 

h 

Keeping terms as far as x® and no terms beyond we have 


Hence 


1 + x + -x‘ + 0-x^ +• 


Jo 


dx 


^+2 +6 +■ 


where the terms omitted involve x® or higher powers. 

This calculation of the series for illustrates a principle which is often useful. 
Let f(y) *» S6nj/” and y =« ^Onx” be power series with nonzero radii of convergence. 
If y » 0 when x =» 0, then the power series for f{y) as a function of x also has a nonzero 
radius of convergence. This series may be found by suhsiilvixng the series for y into the 
series for f(y) and collecting termsJ By uniqueness, the series so obtained is the Taylor 
series. 


PROBLEMS 


1. It is desired to approximate a function /(x) by a polynomial p(x), 


p{x) a<, + oix H-h a«x". 


in such a way that at the origin p(x) has the same value and the same first n derivatives 
as/(x). (a) How should the coefficients be determined? Hint: Oq «» p(0) **/(O), ai «» 

p'(0) » /'(O), 2a2 * p'XO) « /"(O),_ W If the coefficients are determined as m (a), 

what relation does p(x) have to the Maclaurin scries for f(x)? 

2. For the following functions obtain a polynomial approximation valid near x *« 0 
by finding the first three nonzero terms of the Maclaurin series: 


tan X, *, sec x, 


sin X 

iTc*' e* -1' 


^ For proof, see K. Knopp, ‘‘Theory and Application of Infinite Series,’' p. 180, Blackie 
A Son, Ltd., Glasgow, 1928. 
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8. (a) By means of series compute a; — 10 sin (x/10) to three significant figures when 
X ^ 1.000. (6) Attempt the same calculation by using a table of sin x (note that a; is in 
radians). How many significant figures for sin x are needed? 

4. If j/ »« 10 (tan X — x)/x^, (a) use series to evaluate y near x «* 0. In particular, 
what is the limit of y bsx 0? (6) Plot y versus x for 0 < x < 0.2. (c) Discuss the 
construction of such a graph by use of a table of tan x^ without series. 

6. By use of series compute to three places of decimals: 

(a) ^ « 27183e^ ^ (6) cos_l0^ « cos (tt/IS); 

(r) frin 33° « sin (30° + 3°); (d) ^35 « 2(1 -f 1^2)^. 


6. Evaluate by series the first three integrals to three places of decimals and the last 
to two places: 

sin X dx log (1 z) 


f sin (x*) dx, [ 
Jo Jo 


f 

Jo 


dz, 


VT — X* Jo 
j (2 — cos x )*"^ dx ^ j ^ 1+2 sin^ dx. 


7, Determine the magnitude of a if the error in the approximation sin a ^ a is not 
to exceed 1 per cent. Htrii (a — sin a)/a *= 0.01 and sin a *= a — («V30 + (aV50 - 


8. Discuss the percentage error in the approximation (11-3) as x 0. How would 
the p<.^rcentage error behave if the term 0.0(X)01 had not been added? (Tlus shows that 
it may be better not to alter the Taylor series even when such alteration reduces the 
absolute error.) 

Ah in Example 1 of the text, obtain a polynomial approximation for the Fresnel 


sine integral 



dt which is valid within ±0.00001 for 0 < x < 3^. 


POWER SERIES AND DIFFERENTIAL EQUATIONS 

12. First-order Equations. One of the most important uses of power 
series is in the solution of differential equations. P'or example, to solve the 
equation y' == y assume that 

1/ = ao + aix + H-h UnX^ H-. 

Then, according to Theorem II, Sec. 8, 

y' = ai + 2(12^ + 3a3X^ + d-h (rt + d-. 

Since ?/' = t/, Theorem III of Sec. 8 shows that the series for y* and the 
series for y must have the same coefficients. Thus, 

Ul == up, 2^2 = (l\y 3^3 = • • '} “H i)^n + l ^ . . .. 

Starting with ao = c, a constant, we solve for oi, a 2 , ... in succession to 
obtain 

ai“C, 02 = -. «3 = ^’ •••, 
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wd henoe 

cx® ex® ex” 

,- 0 +.* + - + - + .■•+- + ■••. 

This discussion is tentative only, since at first we had no assurance that 
the equation y* » y possesses a solution expressible as a power series. 
However, the ratio test shows that the series obtained converges for all x. 
Hence by Theorem II of Sec. 8 the term-by-term differentiation is justified 
and the equation //' = y has actually been solved. 

As another illustration, we obtain a power-series solution for the differ¬ 
ential equation 

(1 + x)y' ^ py, p « const, (12-1) 

such that y * 1 when x « 0. If p » Sanx”, then ^ 

y'm. ai + 2a2a; + 3 aax^ H-h (n -f l)an4.ix'' H- 

xy* * aix + 2 a 23 ^ H-h ruxnx'^ H- 

P2/ « poo + paix 4* pa23^ H-h po>nX^ + “ * 

and the substitution in (12-1) yields 

ai + ( 2 a 2 + ai)x + (803 4- 4- * — h [(^ 4- l)on-hi 4* nan]x^ -f . .. 

» pao 4- paiX + pa2X^ 4 -h pOnx’' 4 -. 

Equating coeflScients of like powers of x gives the set of equations 

Oi ~ poo 

202 4“ ax = pax 
803 4- 2 o2 = pa 2 


(n 4- l)an 4 -i 4“ nOn = pan 
which must be solved for the os. 

Since y ^ 1 at x = 0, we must have Oo == 1. Then we get, in succession, 
p(p - 1) p(p - l)(p - 2) 

Oi = p, Oo ~-:-» 03 =-» ... 

2! 31 

80 that the solution is 


y « 1 + px + 


P(P - 1 ) ^2 


2! 


x='+- 


p(p - l)(p - 2) ... (p - n + 1) 

H-;-x" + • 

n! 


( 12 - 2 ) 


Throughout Sect. 12 to 14 2) means A brief review of this tigma notation is given 
in Sec. 2. 




POWER SERIES AND DIFrERENTIAL EQUATIONS 


155 


SBC. 13] 

Hence, if the differential equation (12-1) has a series solution, that solu¬ 
tion must be (12-2). However, the ratio test shows that the series (12-2) 
converges for |a;| <1. Hence for \x\ <1 the term-by-term differentia¬ 
tion was justified, and this shows without further discussion that (12-2) is 
a solution for |x| <1. 

Equation (12-1) can be solved by elementary methods as follows. Sep¬ 
arating variables, we get 

dy dx 

— = p -' 

y l + X 

so that log 2/ = p log (1 + x) ~ log (1 + x)P wlmn we recall that p * 1 at 
X = 0. Hence 

2/ =» (1 + x)^ 

Comparing this solution with that found fonnerly ^ gives 


(1 + X)" = 1 + £ 


(p ~ n + 1) 


which the reader will recognize is the binomial theorem. Since no assump¬ 
tion was mxide about p the result ts valid for all p provided \z\ <1. 


PROBLEMS 

1. Obtain power-sericB Boiations of the following differential equations, whiih satisfy 
y « 1 at X ““O' 

y' y' « y -f X, ?/' -f y * 1. 

2. Obtain the first three terms of a series solution y »= Sanx” for the problem y* 2:^y^ 

y « 1 when x *» 0. From these three t-erras compute the curvature k « j/"fl -j- 

at j =“ 0. Is your value for curvature exact or only approximate? 

3. By considering the equation j/' ** (1 — obtain a series expansion for sin”^ x. 

In particular, show that 

6 * 2 ’*^2 3 - 2 * '^ 2-4 5 ' 2 ®'^ 2 ' 4-6 7 - 2 ^'^*’‘' 

13* Second-order Equations. Legendre Functions. To illustrate the 
use of series for solving second-order differential equations consider the 


equation y" - xy'+ y ^ 0. (13-1) 

If p == SUnX"*, then p' « SnonX^’'^ and p" = Sn(n - l)a„x^'~^ and, hence, 
y*' rss 2a2 3 •2a3X (n + 2)(n + l)un-f 2 ^’' +... 

—xp * —“ fljx — • • * nOfiX — • • * 

p == ao + aix -I— • + Onx” 

* By Chap, 1, Sec, 17, the problem lias only one solution. 
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According to (13-1) the sum of these three power series is sero, and there¬ 
fore, by Theorem III of Sec. 8, the coefficient of in this sum is zero for 
each n; 

202 + Oo « 0, coefficient of 

3 *2 o3 — Oi -h Oi = 0, coefficient of x^, 


(n + 2)(n + l)a„-|-2 — ruin + “ 0, coefficient of 

Hence, 

(n + l)(n + 2) ” 

This recursion formula gives, in succession, 


( 13 - 2 ) 


1 

1 

= --00 = 


1 

1 



3 

3 

= - Oa = 

5-6 



<hn 


13*5 ... ( 2 n - 3 ) 

^n)I 




n > 1. 


Similarly, aa = 0 • ai = 0, as = 0, ay = 0 , and so on. Hence the solution is 

^2 ^ 4 ^- 3 , 1 - 3.5 „ \ , 

w=«ao(l-- X - x^ - * I + aix 

\ 2 ! 4 ! 6 ! 8 ! / 


with the two arbitrary constants Oq and ai. There should be two constants 
in the general solution because the equation is of the second order (Chap. 1, 
Sec. 21). 

The ratio of two successive nonzero terms of the infinite series satisfies 


OZn**" 


2n ~ 1 

(2n -h l)(2n -f 2) 


1:^*1 



(13-3) 


as is seen by using (13-2) with 2n written in place of n. Since the limit of (13-3) is zero, 
the series converges for all x. Hence the term-by-term differentiation is justified, and 
we really do get a solution, 

Upon choosing oo » 0, ai « 1 we see that a particular solution is 


Vi X 


(13-4) 
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and the choice oo « 1, Oj » 0 shows that another particular solution is 


y2 


1 

2 ! 




l*3-5 

8! 


(13-5) 


According to Chap, 1, Sec. 21, two solutions yi and 2/2 such as this are 
linearly independent if the equation Ciyi + C 22/2 ^ 0 can hold for constants 
Cl and C 2 only when Ci — C 2 == 0. The solutions (13-4) and (13-5) are in¬ 
dependent, since one is an odd function and the other is even.^ Hence the 
expression aoyi + aii /2 is the general solution of (13-1) (Chap. 1, Sec, 21). 
The independence of i/i and 2/2 can also be deduced from the following 
theorem: 

Theorem I. Let pi = SUnx” and 2/2 ~ he power series with nonr 

zero radii of convergence^ and suppose that Pi ^ 0 , If pi and 7/2 linearly 
dependentj then there is a constant c such that bn == can for all values of n. 

From ciSono:" + 0, it follows that 


fjan “h cgbn M 0, n «9 0, 1, 2, .... (13-6) 

If 02 =" 0, then (13-6) gives an « 0 for aU n, contrary to hypothesis. Hence C 2 0, 
and (13-6) gives « (~'ri/r 2 )an. This is the required result. 


Obviously not every differential equation has solutions that can be 
represented by tiie power series,^ The following theorem due to Fuchs, 
which we state without proof,* gives sufficient conditions for the existence 
of power-series solutions of second-order linear equations. 

Theorem JL Let y" + fi(x)y' +/2(x)?/ - 0 have coefficients fi{x) and 
/2(x) which can he expanded in convergent power series for \x\ < r, Tli^n 
every solution y can he expanded in a convergent power series for \x\ < r. 

The solution series converges at least for |a;| < r but may converge in a larger inter¬ 
val. For example, consider the equation 

(2 - x)i/" + (x - W - V « 0. (13-7) 


Writing (13-7) in the form 



0 


(so that the coefficient of y" is 1 as in Theorem II) we get 


Mx) 


x — 1 1 a; — 1 

2 -^ x “ 2 1 ~ 1^' 


1 11 

‘ A function/(x) is odd if /(—x) « —/(x), etren if/(—x) w/(x). 

* Thus, xy' w 1 has a solution y »• log x which cannot be expanded in Maclaurin's 
series. 

* A simple proof is given in H. T. H. Piaggio, **An Elementary Treatise on Differential 
Equations and Their Applications/' 2d ed., George Boll & Sons, Ltd,, London, l^)28. 
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Snce/i(x) and/aC^p) have power-series expansions for |ar| <2, Theorem 11 asserts that 
the solution y also has a power<«erieB expansion valid for |x| <2. Actually, the solu¬ 
tion of (13-7) is 

y - ciS* -f cad — x) - Ci2 — -f cad - x), 
ni 

which converges for all x. 


As another example we shall find the complete solution of Legendre^6 


eqmtion 


(1 - x^)y'' — 2xy' + p(p + l)y « 0 , 


(13-8) 


where p is a constant. 

By Theorem II this equation has a power-series solution valid at least 
for (x I < 1. Assuming y = Sonx”, we get 

y" == + 2)(n + l)x’‘ 

—— 2 — ann(n — l)x'‘ 


—2xy’ = 2 — 2/ianx” 

P(P + 1)2/ = '^dnPiv + l)x". 

By (13-8) the sum of these series is zero. Considering the coefficient of 
x" yields 


«n+ 2 (n + 2)(n + 1 ) » an[n(w + 1) - pip + 1)] (13-9) 

after slight simplification. For all n > 0 we have 

(p - n)(p + n + 1 ) 

a «+2 =“ -a„ ■ ■ .— - 7 -; — (13-10) 

(n + l)(n + 2) 

after factoring the bracket in (13-9) and dividing by (n + 2)(n + 1). The 
coefficients for even n are determined from Uq, and those for odd n from Ui. 
Computing the coefficients successively, we get the final result 


ao 


1 - 


p(p +1) 

21 




P(P - 2)(p + l)(p -F 3) 
4! 




-f- 


Oi j^a 


(p-l)(p + 2) (p - l)(p - 3)(p + 2)(p + i) 


3! 


■x^ + 


5! 




Theorem II guarantees that the general solution can be obtained in this fashion, and 
indeed, by Theorem I, the coefficients of oo and ai in the foregoing expressions are in¬ 
dependent. Equation (13-9) shows that the series converge for 1*1 <1 when we apply 
the ratio test to the ratio of successive nonzero terms. When p is a positive even integer, 
however, the expression involving oo reduces to a polynomial, hence converges for all x. 
If Oo is so chosen that the polynomial has the value unity when * » 1, we get the sequence 


Po{x) 

Ptix) 

Piix) 


1, 

3 2 1 
2 2 

7 5 , ^5-3 , 

— X* ^ 2 

4*2 4*2 


+ 


3‘2 
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and so on. Similarly, when p is a positive odd integer, the coefficient of at terminates 
and we get a sequence of which the first three terms are 

Pi(a:) « X, 

Pi(x) « -x* - -z, 


PM 


2 

9-7 


2 


-- 3-5 _ 2 — X® 

4-2 4-2 


6-3 

'4*2' 


These polynomials are known as Legendre polyrwmials; they arise in several branches of 
a]>ph(‘d mathematics. 


PROBLEMS 


1 . Solve by means of f>ower SKiries: 

0, y" -f y ^ y'^ - x?/ « 1. 

2. For Eq. (13-7), show that y = Xanx”^ leads t-o 

2(7i -f- 2)(/; 4* l)an4 2 — (n 4 l)“nn fi 4' (« l)a7i = 0. 


(a) By taking no =* Oi =» 1, find a solution satisfying y = y' « 1 at x « 0. {b) By an¬ 
other choice of oo and at find a solulion satisfying y « —1, y' = 1 at x » 0. 

8. Solve by means of power senes if y(0) « y'(0): 

(x^ - 3j- + 2)y" + (x^ -2x- \),/ + (x - Z)y = 0. 

4. Solve y" — (x — 2)y =« 0 by assuming y * Xan(x — 2)”. Also obtain the first 
three terms of the solution in the form y ~ 2)orix” 

6. It can be shown ^ that 


(I - 2x;i -f ^ 4- P2ix)h^ + • • • 4- Pn(r)h^ 4- " •, 

V(‘nfv this equality through the terms m h’’ Hmi Mvpaml fl — (2/jr — by the 

binomial theorem, and collect powers of h The function (1 — 2xh is called 

the genexaiinq jvnehon of tiie srHjucnce Pn(x). 

6. Verify Rodrigues forimda^ 

Pn(x) - - D” 

tor n “ 0, 1, 2, 3. 


2”nl dx" 


14. Generalized Power Series. BesseBs Equation. An important dif- 
ftTeiitial equation was encountered by the German astronomer and mathe¬ 
matician F. W. Bessel in a stiidy^ of planetary motion. The so-called Bessel 
functions which arise from the solution of this equation are indispensable 
in the study of vibration of chains, propagation of electric currents in 
cylindrical conductors, heat flow in cylinders, vibration of circular mem¬ 
branes, and many other problems of applied mathematics. 

Bessel’s equation is 

xY' + xy' + ( x * - pYj = 0 , ( 14 - 1 ) 

1 E. J. Whittaker and G. N. Watson, “Modern Analysis,” pp. 302-303. Cambridge 
University Press, London, 1952. 

^Ibid, 
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whare p is a constant. Theorem II of Sec. 13 does not apply to this equa¬ 
tion, since 

1 

/iW =. Mx) « 1 ^ 

X X* 

and these functions cannot be expanded in power series near j =» 0. For 
this reason we do not expect a power-series solution y = XanX^. It was 
shown by Fuchs, however, that a wide class of equations, including (14-1), 
have solutions of the form ^ 

00 

y = x^Sanx" = E a„x"+'>, Oq 5-^ 0 (14-2) 

where p is constant. The theorem of Fuchs reads as follows: 

Theorem I. Let xfi(x) and x^f 2 (x) have power-series expansions valid for 
IX1 < r. Then the equaiion 

y" + fi{x)y' + / 2 (x)y = 0 

has a solution of form (14-2), also valid for |xl < r. 

Since BesseFs equation gives 

x/i(x) = 1, x^/aCx) = X® - p®, 

Theorem I asserts that the series (14-2), when found, will be valid for all x. 
To obtain this series, note that 

I* i = z a„x"+'-+=' = Z o„_2x“+^ (14-3) 

naaO naaO n«»2 

as is seen by writing out in full. The limits (2,oo) on the latter summation 
may be changed to (0,oo) if we agree to define 

On == 0 for all negative n. (14-4) 

Hence 

xV' = 2an(n + p)(n + p - l)x"'^^ 
xy^ = Xonin 4“ 
x^y = 

•^p^y » 2 — p^anX^^^, 


According to Eq. (14-1), which we wish to solve, the sum of the four terms 
on the left of the above equations is ssero. Hence, the same is true for the 
series on the right. Equating to zero the coefficient of in the sum of 
these series gives 

cin{n -f- p)(n + p — 1) + o.n{n + p) + an^% — p^an « 0 


' The novelty is that we allow p to be any number, whereas if (14-2) is an ordinary 
power series, p must be an integer. Since p may be increased at will, the assumption 
00 ^ 0 involves nb loss of generality. 
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or, after simpMcation, 

OnKn + p)^ — + an-2 = 0. (14-5) 

Equation (14-5) is valid for all n. For negative n Eq. (14-5) holds auto¬ 
matically by virtue of (14-4). The first nontrivial case of (14-5) is called 
the indicial equation; it is obtained in the present example by putting n «= 0 
and takes the form 

OoCp^ — p^) + 0 «= 0, indicial equation. (14-6) 

This shows that p = p or p == — p, since ao ^ 0. I'lie other values of Un 
are determined from (14-5) in the form 


On 


1 

(n + p)^ 


an—2* 


(14-7) 


The choice n = 1 gives' ai = 0, and hence Un = 0 for all odd n. Also 


02 = 




Oo 


(2 4- p)" 
«2 


tto 


(4 + p? - p* [(4 + p? - f]\{2 + pf - p*] 

and so on. In this way it is easily verified that the Serbs correspotr fing to 
p = p is 


y = dox^ 


1 


+ 


2(2p + 2) 2-4(2p -f 2)(2p + 4) J 

and that the series for p == — p is the same, with —p in place of p. 

When p is a nonnegative integer, the expression may be simplified by use 
of factorials, as follows. We take a factor 2 from each term of the denom¬ 
inator and place it with the x in the numerator, obtaining 


y = oqX^ 1 


(x/2)^ 


+ 


ix/2)* 


l(p+l) l-2(p+ l)(p + 2) 

If the denominators are now multiplied by pi, there results 
r 1 (x/2f (x/2-f 

y [p! l.(p+l)!'^ 2 r(p + 2 )! 

and since x" = 2^(x/2)'’, this yields y = aop\2’’Jp{x), where 


(14-S) 




(-l)”(x/2)®”+'’ 


n-o wKp + n)! 

’ Provided the denominator in (14-7) does not vanish. See Prob. 7. 


(14-9) 
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The function Jp{x) is called the Bessel function of order p. The graphs of 
Jo(x) and Ji(x) are shown in Fig. 7. 

Now, the differential equation (14-1) is meaningful even when p is not a 
positive integer, and the series solution (before introduction of factorials) 
is also well defined for general p. It is natural to inquire if we can define p! 
in such a way that (14-9) is meaningful and satisfies (14-1) for p unre¬ 
stricted. A glance at (14-8) shows that such an extension may not be pos¬ 
sible when p is a negative integer, but there appears to be no difficulty 
otherwise. 



To obtain an appropriate definition of p! for arbitral}" p, we introduce 
the so-called gamma f unction Y{p) defined by 

r(p + 1} = / re~^dt, p > 0. (14-10) 

Jo 

This function was discoven'd by the celebrated Swiss mathematician L 
Euler. Because of its connection with />’ the function r(p + 1) is often 
called the factorial function and is written p^ or 11 fp). We shall use the 
notation p! as soon as we have estn-blislied that (14-10) gives the appro¬ 
priate value when p is an integer. For comparison with other notations, 

p! ^ n(p) ^ r(p + 1). 

If P > 1. integration by parts * m (14-10) gives 

fOO lOO roo 

I P I dt. 

Jo lo Jo 

Since the integrated term drops out, the foregoing relation simplifies to 

r(p-bl) «pr(p) (14-11) 

when we use (14-10). 

Writing (14-11) in the form 

r(p) « p~»r(p -b 1) (14-12) 

' Since the improper integrals (Sec. 3) are convergent, the process is justified by writ¬ 
ing b in place of oo, carrying out the partial integration, and then letting 6 -♦ oo. Actu¬ 
ally (14-10) holds for p > — 1, and the partial integration is valid for p > 0. 
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enables us to define r(p) for negative values * of p. Thus, if any number p between 
0 and 1 is used on the left side of (14-12) then the right side gives the value of r(p), 
for when p > 0, T(p 4* 1) is determined by (14-10). If the recursion formula (14-12) is 
used again, the values for —1 < p < 0 can be found from those for 0 < p < 1, That 
is, p 4* 1 in (14-12) ranges over the interval (0,1) if p ranges over the interval ( — 1,0). 
Similarly, when we know r(p) for —I < p < 0, we can find r(p) for —2 < p < — 1, 
and so on. 

Inasmuch as (14-10) gives 

r(l) * rt^e~^di « I » 1, (14-13) 

^0 lo 

the method fails for p » 0. Thus 

lim r(p) « lim p“^r(p 4 1) 4» or — 

0 p 0 

according as p 0 through positive values or through negative values. Similar be¬ 
havior is found for all negative integers, and hence the 
graph of r(p) has the appearance shown in Fig. 8. 

However, by use of (14-10) and (14-11) it i.s easily 
verified that T{p) never vanishes, and hence, if vfa 
agree that l/r(p) 0 for p a negative integer, it w'lll 
follow that the function l/r(p) is well behaved for 
every value of p without exception * 

Equations (14-11) and (14-13) give r(2) »*» r{l) = 1 
and, in succession, 

r(3) - 2r(2) « 24 
r(4) « 3r(3) - 3>2 1 


r(n 4 1)“ nVin) « n{n - 1) ... 3-2-1. 

Hence, the definition 

p? = r(p 4 1) (14-14) 

furnishes the desired generalization; it gives a meaning to p’ for all p except p « — 1, 
—2, —3, ..., and it gives the famihar value w^hen p is a positive integer. The properties 

p! « p(p - 1)1 or P ^ “ (p ~ ~ " ij1 (14-16) 

are ensured by (14-11). The former fails when p is zero or a negative integer, but the 
latter holds for all p, without exception 

The result of this discussion is that 1/p! is well defined everywhere, and 
p! is well defined except for p = —1, —2, —3, ..at which values 1/pl 
vanishes. Moreover, we have the fundamental formula (14-15). When a 
series containing factorials is obtained by solving a differential equation, it 
is almost always this relation (14-15) that makes the series a valid solution, 

* Equation (14-10) does not serve this purpose, because the behavior of at ( 0 

makes the integral diverge when p < *~1. 

* The relation of r(p) and l/r(p) is quite analogous to the relation of esc p and sin p, 
respectively. 





INFINtM SERIES 


164 


[chap. 2 


and hence the extended definition of p! may be used without hesitation. 
In particular, Jp{x) and J^p{x) are solutions of (14-1) no matter what 
value p may have. 

For most values of p the functions Jp{x) and J^p{x) are independent, by 
Theorem I, Sec. 13, and the general solution of Bessel’s equation is 


y =* CiJp(x) + C 2 J_p(a:), Cl and C 2 const. 

If p « 0, however, then the two roots of the indicial equation are both 
p = 0, so that we obtain only the single function Jq{x). Another excep¬ 
tional case arises when p = ±1, dt2,- Although the series (14-8) is 

meaningless when p is a negative integer, the series (14-9) is well defined 
and satisfies 


J^mix) - i-lTJmix), m - 0, 1, 2, 3, . . .. 
To see this, we observe that 

n\{n-m)\ !(t^ - m)! 


(14-16) 


(14-17) 


since the factor l/(n — m)! ia zero when n < m. If the sums (14-9) and 
(14-17) are written in full, then (14-16) follows at once.’ Because of (14-16) 
the functions J_m(^*) and Jm{x) are dependent, so that we obtain only one 
solution rather than two. 

This failure of the method to provide both solutions is not a serious 
shortcoming, sin(‘e the second solution can always be found from the first 
by the method of Chap. I, Sec. 29, Carrying out the calculation in the 
general case yields the following theorem: ^ 

Theorem TT. ]l7icn ihe rooh pi and p 2 of the indicml equation are distinct 
and do not differ by an integer^ the vulhod of Theorem 1 yields two linearly 
independent solutions. If the roots do differ by an integer a second solution 
can he found by assuming that 

(X) 

2/2 = log j +- s (14-18) 

n-^0 


where yi{x) is the solution given by Theorem I for the root p ~ pi. 


By setting yiix) » 
equation for p ** 0 ia 

Kq{x) « 


J^\(x), for example, one fan show that a second solution of Bessel’s 


* Jo(x) logx - X) 

A--.1 






^ It is suggested that the reader verify this statement by actually writing the sums 
in full 

* It was shown by Frobenius that the second solution can also be obtained by differen¬ 
tiating the first solution with respect to the exjionent p; of. Piaggio’s book cited ia 
Sec* 13. 
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This function is called the Bessel function of the zeroth order and second kind. Thus, 
the general solution of 

+ jr' 4- w 0 

is 2 / *» ci/o(x) -f Other functions Kmix) of the second kind are obtained simi¬ 

larly. By considering linear combinations of Jm{x) and Km{x) we get the modified Beend 
functiom of the first and second kinds, denoted in the literature by Im(x), Ym(x), Nm(x), 
and Hm(x), 


PROBLEMS 

1. Show that Jq(x) « —/i(x) and also that 

— xVnW « xVn^lix), * -X~'Vn^\{x). 

Deduce that 

Jn^-xU) - 2j;(r), 

2u 

Jn-^lix) +Jn-i i(x) “ — Jnix). 

3 * 

2. The confluent hypergeometric equation is 

xy" -f py' « xy' 4- gy^ 

where p and g are constant. 

{a) According to Theorem I, what range of validity do you expect for a solution of the 
form Sanx”~*“^? 

(b) Assuming that y ~ verify that xy" « Xun^iin -f p 4- l)(n 4- and 

find similar expreasion.s for py\ xy\ and qy 

(c) By considering the coefficient of deduce 

On fi(« 4- p 4- l)(w -f P -f- p) =* 0„{n 4- p 4- g) 

for all values of n. 

3. In Prob. 2, (a) show that the roots of the indicial equation are p «« 0 and p «* 1 — p. 
Hint- The first nontrivial case arises when n =* -* 1. 

(6) When p =« 0, show that 

_ n 4- g _ 

On 4-1 «n ^ 

if p is not zero or a negative integer. Thus get the solution 

fi + 4 - 5iLl:’If! + ^! + 1 

L p(p + 1)2' pip + a)(p + 2)3' +”‘J' 

(c) Similarly, obtain the solution corresiKinding to p =« 1 — p when p is not a positive 
integer. 

4 . For the hypergeo metric equalton of Gauss, 

x(l - x)y" 4- Ic ~ (a 4- 4” ~ aby « 0, 

obtain one solution in the form 

* r(a -f n) Vib 4- n) 

^ n^o r(l + n) Vie 4- n) 


when o and b are not negative integers. 
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xy" + (1 ~ 2p)y' -h a^/ « 0 


obtain a recursion formula for the coefficients of a series solution, and show that one 
solution is 1 / « x^Jp(x). 

6* As in Prob. 6, show that y » x^Jp(\x) satisfies 

+ (4X^x^ -- 4p^ -f l)y «* 0. 

7* (a) Given that r(l4) *=' ‘x/’r, obtain the formulas 


Jh(^) 






{h) What is the general solution of Bi'ssel's equation with p *» (Tliis shows that 
Theorem I may yield the general solution even if p 2 — pi is an integer.) 

8 . The generating function of the sequence Jni^c) is 

= f; J„{x)h’‘. 

n»~oo 

Verify that the coefficient of in the expansion of the exponential is, in fact, ./o(-^). 
Hint: By the series for the exponential is 

Pick out the term independent of h in the binomial expansion of [A ~ (1/'^)]'* wh(»n n 
is even, and note that there is no such term when n is odd. 


SERIES WITH COMPLEX TERMS 

16. Complex Numbers. The eipiation + 1 = 0 cannot he solved by 
means of real numbers because the rule of signs does not allow the s(|uar(* 
of a real number to be negative. But if one adjoins a symbol i to the real 
numbers^ which satisfies the equation 

(15-1) 

by definition, then one can construct the so-called complex numbers a + hi 
The latter satisfy the algebraic laws obeyed by real numbers, and they in¬ 
clude the real numbers as a special case. Moreover, complex numbers en¬ 
able us to solve not only the equation x* + 1 = 0 but cverij polynomial 
equation. 

Since we want to keep the familiar laws of algebra, it is easy to see how 
addition and multiplication of complex numbers ought to be defined. In¬ 
deed,^ if a, b, and ^ in a + ih are to be treated like any other numbers of 
elementary algebra, then 

(a + ih) + (c 4“ i^ ~ (o + c) ~f" i(b + d), (15-2) 

* See also Chap. 7, which contains a discussion of complex numbers and functions from 
a somewhat different point of view. 
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This equation is now taken as the dehnition of addition. In the same way 
we are led to define multiplication by 

(a + ib)(c -f- id) = (ac — bd) + i(bc + ad), (15-3) 

since elementary algebra would give the product 

ac + ihc + i(id + iH)d — or + i^bd -f- i(bc + ad), 

and (15-1) asserts that i^ = —1. Finally, we agree that a + ib ^ a + bi. 

It is easy to verify that these definitions (15>2) and (15-3) do pres(‘r ve the familiar rules 
of algebra (including those rules that were not considered in fr ariung the definitions) For 
example, complex numbers Zk satisfy 

2-1 -f- 22 =« 22 + 2l, {Zi + 22) d ~ + (?2 Zj), 

ZiZ2 * 22^1, (2i22)2'j ~ 21 ( 2223 ), 

21(22 -f 23 ) = 2i22 4- ZiZz. 

Also there is a zero, and there is a unit: 

2 4 (0 4- Or) *“ z, z{\ 4- Or) «= 2 for all z. 

Moreover, the complex number a + Oi is found to be equivalent in every respect, except 
notation, tx) the real number a Hence in this sense the complex numbers contain the 
reals as a special cas€\ and we have a right to consider that 

a4“0i^a. (15-1) 

The convention (15-4) also agrees with our purpose of keeping the rules of algebra intact. 
Using (15-4) we write 0 and 1 for the zero and unit element of our algebra. Subtraction 
iH defined by considering the eejuation (a + tU) -f 2 = 0; it will be found that 

-(a *-1 ht) « (-a) 4- {—h)i =* (~l)(u th). 

Although wo made no attempt to preserve the curua^lhition law, it is nevertheless valid; 
that is, 

ZiZo = 0 only if zi =* 0 oi 22 = 0. 

And finally, the possibility of division is sugge.sted b 3 " 

a 4- ih (a 4 /5)(r — id) 
c id (c 4“ id)(c ~ id) 


ac 4" bd he ~ ad 
“ + ’ > 1^2 ■ 

Now, the latU‘r expression can be shown bj'^ (15-3) to satisfy the equation 

(c 4- id)z « a 4 - 

Hence, the result of this heuristic calculation is, in fact, the quotient. The process breaks 
down if c 4* td « 0, but only then. 

The general tenor of this discussion is that the algebra of complex num¬ 
bers agrees with the algebra of real numbers and we need not hesitate to 
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apply the familiar rules to the new symbols. There is one new feature, 
however. When we iBay that a + is a symbol for a complex number, we 
assume, naturally, that different symbols represent different numbers. In 
other words, if 

a + ib a' + ib' for a, a', 6, V real, then a ®= a' and b = 6'. (15-5) 

This important relation may be taken as the definition of equality. Un¬ 
like the algebraic properties described hitherto, (15-5) is true for complex 
numbers only; it does not hold if i is replaced by a real number. 

The following alternative analysis of (15-5) shows the role of the equation «• —1 
and also shows why a, a\ h, 5' must be assumed real. If a 4 - ** a' + then 

a — a' ^ i(h' — h). 

Squaring gives (a — o')* « —(5 — ft')* because »» —1, and hence 

(a ~ a')^ 4 (6 - 6 ')* « 0 . 

Since the square of a real number is positive unless the number is zero, the latter equa¬ 
tion implies a — a' « 0 and 6 — 5' » 0 . 

Complex numbers z z -h iy may be represented graphically in the so- 
called z plane by introducing two perpendicular axes, one for x and one for 
y (Fig. 9). The z axis is called the real axis, and z is the real part of x + iy; 
the iy axis is the imaginary axis, and iy is the irnaginar>' part of x -h iy^ 
The absolute value of z is the distance from the representative point to the 
origin; it is denoted by | 2 :|, as in the case of real numbers. Evidently, the 
points satisfying 1^1 = r lie on a circle of radius r centered at the origin. 
The interior of this circle consists of the points j ^1 < r. When z — z + ly, 
then 

|2| = V^ + 2/2. (16-6) 

A short calculation based on (15-6) gives 

I 

IzjZal = |2i| Iz2l, — =-j—r iiz2?^0, 

Z 2 \Z2\ 

80 that the absolute value of a product is the product of the absolute values, 
and similarly for quotients. 

Since real and imaginaiy parts are added separately, computation of 
zi 4 Z 2 can be effected as shown ^ in Fig. 9. Inasmuch as the sum of two 
sides of any triangle is greater than the third, the figure gives the important 
inequality 

\Zl + Z2\ <\Zl\ + \^2\■ (15-7) 

^ One may Hank of z a; 4 li!/ as a vector with components x and y. The method of 
adding vectors by adding components agrees with the definition (15-2), and hence the 
construction of Fig. 9 is simply the parallelogram rule familiar from mechanics. 
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A similar result may be obtained from this one for any number of complex 
numbers. 

If Sn =* Un + tVn Rud s ^ u -f- tv, we define lim Sn ^ s to mean that 
simultaneously 

lim Un « u, lim Vn = v. (15-8) 



This shows that the theorj^ of limits for complex numbers can be based on 
the corresponding theory for real numbers. 


PROBLEMS 


1 . Show Uiat \ziZ2\ « l^il \z2\> 

2. Show that hm Sn « s if, and only if, hm |«n — sj =0. 

3. 8k(‘t(*h the set of points z in the or>mplcx plane described by (a) 1 2 1 »* I, (b) 1 2 j <2, 
(<’) 1^1 > 1, (d) I 2 2| <1. Hint: |2 — a| is the distance from the representative 
point for z to that for a. 

16* Complex Series. Convergence of infinite series of complex numbers 
is ilefined by considering the limit of partial sums, just as for real series. 
By (15-8) the complex series converges if, and only if, the two real series 
obtained by considering real and imaginary^ p«‘irts are both convergent. 
In other words, 

'^{Pn + kn) p + iq 

if, and only if, Spr» — P and - ^/- Because of this correspondence of 
real and complex series, most of the results prcjsented hitherto in this chap¬ 
ter apply with little change to the complex case, and the proof also involves 
nothing new. 

As an illustration let us show that the general term of a convergent senes approaches 
zero. If Ufi is complex, we have 

n n~l 

“ S a* — ttA;, 

and taking the limit as n « yields Um an « 0 , exactly as in the proof for real series. 
Alternatively, one may use the result for real series Namely, if a* * pit -f kkf then the 
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convergence of Xak implies the convergence of Spit and of Hence, by the theorem for 
real series, lim pn "® 0 and lim q„ 0. Consequently, lim On « 0, 

As a second illustration, wo show that an absolutely convergent sertes ts convergent. 
With Oit »• PA + iqk, we have 

|p*i “■ < V pi + (ii = |o*|. 


Hence, if S i Oit 1 converges, then ^ 1 pa 1 converges by the comparison teat for real serievs, 
and then Sp^ converges, because we know that for real senes absolute convergence im¬ 
plies convergence. Similarly, i\/A: convfTges, and hence 2 J(pa + uy) converges. 

As a third example, the leader may oblam the analogue of Theorem I, See S, foi com¬ 
plex series. That is, if conveiges for z = zo, tlien the .seiies convergc'S ab.soluii'ly 

for all such that | 2 | < (so I and uniformly for all 2 such that Izi < | 2 j| < |2n| Jt will 
be found that the proof is the same, word for word, as the proof in the ease of real sent's 
The symbol for absolute value, however, has the moaning assigned in (15-6). 


For many series 2u„(2), the set of points z at which (ho series cr)nvergi\s 
gives a complicated region in the z plane. It is a remarkalile fact that for 
a power scries the region of convergence is always a cirtde centered at the 
origin. The circle is called the circle of convergence, and the radius of the 
circle is the radms of convergence. We agree to take the radius as zero if 
the series converges for 2 : - 0 only and as infinity if the series convergi*^* 
for all z. At points on the boundary of the circle the series may eilluu* 
cemverge or diverge, just as in the ease of the int(‘rval of convergence for 
real series. 

For proof that the region is a circle, let be a power series whii h 

converges for some value z — zo ^ 0 but diverges for some other value 
z ^ Zi. As we have already noted, the fact that the series converges for 
2 ; ~ 20 makes the scries converge throughout the circle \z\ < j.^o| On 
the other hand, the .series obviously doe.s not converge throughout, any 
circle containing the point Zi (see Fig 10). We let C be flic largesf curie || 
« ranch that the series converges at ever>’ interior point of C. The radius 
r of C is at least equal to l^ol hut does not exceed \zi\. 

To show that C is the circle of convergence, all we have to do is (establish 
that the series diverges at every point exterior to C. Let Z2 be an exterior 
point, so that \z 2 \ > r. If the series converges at 2 : 2 , then it would have to 
converge throughout the circle 1^1 < 12 ^ 2 !. But this contradicts the fact 
that C is the largest circle throughout which the series converges, and hen(*e 
the proof is complete.^ 

To illustrate the concept of circle of convergence consider the series 

——- - 1 - + x* -f- (-1)V" H- 

1 + x^ 

* The fact that a largest circle exists is quite clear from the geometry. An analyti(‘iil 
proof may be given, if desired, by constructing circles with successively larger radii 
and using the funitidamental principle (Sec. 1). 
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which converges for \x\ <1 but diverges at a: ~ dbl. If 1/(1 -f x^) is 
regarded as a function of the real variable x, there appears to be no reason 
why the series should diverge when |a:| > 1, for 1/(1 + x^) has derivatives 
of all orders at every value of x. But when we regard x as a complex vari¬ 
able, the divergence is explained by the fact that the denominator 1 -f 
vanishes at x = zfct. Clearly, if the circle of convergence cannot contain 
the points then the radius of convergence cannot exceed 1. 



Fig 10 


PROBLEMS 

00 

1. Verify that the series 21 z^/rv converges absoluMy at every boundary point of 

1 

oo 

its circle of convergence wlicrcas 23 converges at no boundary point of its circle 

1 

of convergence 

2. If/(z) - f-i"’ and /(O) => 0, 

(а) How does J{z) behave when z «= x and x 0 through real values? 

(б) How does /(z) behave when z ^ ty and 1 / —► 0 through real values? 

(c) Could this function have a power-series expansion valid m some circle contain¬ 
ing the origin? Explain. 

17. Applications. By means of power series the functions sin x, c*, log x, 
tan"^^ X, Jp(x), and so on, may be extended to a complex variable z, I'or 
example, since the series 

x^ x^ 

-~l 0 g(l—x) + + -1-1- 

2 3 n 
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converges for [a^j < 1, we know without further discussion that 

^ + + -1-•— 

2 3 n 


converges for jzj <1, The latter series is the definition of — log (1 — z). 
Similarly e*, sin z, and cos z are defined by 





sin z « 


(_l)n^2n^2 

l2i^r+iyr^ 


cos Z = 


X 


(-l)V” 


Many familiar formulas can be extended at once to the complex-variable 
case. To establish that 


sin^ z + cos^ z = 1, 


(17-1) 


for example^ we reason as follows. It is known that (17-1) holds when z is a 
real variable x, and hence 



2 


2-^ — i - 

+ 

V ^ 

(2?i 1) 1 - 

L" (2»t)! J 


(17-2) 


when z is real. The left side of (17-2) is a po^\er series, as we see by imagin¬ 
ing that the terms are collected. Since the power series is zero for an inter¬ 
val of X values, every coefficient must be zero (Sec. 8, Tlieorem III). Hence 
the power series is also zero when z is replaced by a complex variable z, and 
this establishes (17-1). 


The same method may be used to prove formulas involving two variables; for example, 
from 

for xi and X 2 real, 

it follows wiihoiU detailed calculation that 

c*i+*2 «, g*ie;*2 for asj and zt complex. 

The systematic development of this idea leads to a branch of analysis known as the 
theory of analytic continuation. 


Upon letting z = iz in the series for c*, we get 
(tx)” ^ 


, X" zr x" 

5-a: - =» 1 —-f- — —-f-' 

nl 2! 4! G! 


,(x 


when we write out the sum in full, noting that 


{2 « 


■u 




1 , 


X X 
- 1 - 

3! 6! 


It « 


7! 


+ * 
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The jBeriee representations for cos x and sin z now give Euhr^sfmmula, 

= cos X + i sin x, (17-5) 

which expresses the exponential function in terms of the trigonometric 
functions. On the other hand (17-3) also leads to 

(JOS X =-, sin X --;-» (17-4) 


as the reader can verify. These equations are constantly used in the study 
of periodic phenomena, for example, in network anabasis and synthesis, in 
physical optics, and in electromagnetic theory. I'he calculations are ordi¬ 
narily carried out by complex exponentials, and then the appropriate 
trigonometric form is obtained by taking real or imaginary parts [cf. (17-3)1. 


Example 1. Obtain the trigonometric identity 


cos w -f- cos 2u H-h cos nit = 


sin {n 4 " 

2 sin 

for It 0 , ±2ir, :±:4ir, ..,, by considering the exponential sum 
« « 4- 4. ... 4- 

The series s is a geometric series with ratio r « c*“, and hencse 


1 


(17-5) 


(17-6) 


by ( 1 -X 8 ). If the numerator and denominator in (17-6) are multiplied by we get 

S vsi . I II, - -- SE3 . . . . . . . . . 

2t sin Hw 

upon using (17-4). By (17-3) this yields another expression for the exponential sum s, 

cos (n 4' 4" 1 sm (n 4- H)w — cos }'iu — i sin 

* 2t sin ^iu 

which leads to (17-5) when wo equate real parts. 

flTT 

Example 2. Show that / c*** dz *« 0 if A’ is a nonzero integer, and deduce 


r 

/: 


• cos mx dx 


cos nx sin mx dx =« 0 


J sm nx Bin nixax ^ 


if w 5 ^ n, 
if m n, 


if m 7 ^ n, 
if m » n, 


(17-7) 


whenever m and n are positive integers. 
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If A; is a nomero integer, (17-3) gives 

/.air ^2v 


J r* /** 

' dx » I (cos kx 4* i sin kx) dx » 0, (17-8) 

0 ./o 


which is the first rissult. By (17-4), 


i c 


•A 


4 / cos nz cos mx dx ^ j (c*"^ 4- (e*”** 4- <ia; 

I j-g»(in4n)* ^ ^•(w—«)« ^i(n~m)x i(OT4~n)z| 

Jo 


Each term is of the form with k an integer, and hence we get zero unless m « 7i. 
If m ^ n, the two middle terms give 2, so that Uie integral is 4ir. The other relations 
(17-7) are established similarly. 


PROBLEMS 


1. By using the series definition for e*, show that (d/dx)e^^ « when c is a comph^x 
constant. (Since we have not defined the derivative with respect to a complex variable, 
assume that x is real.) 

2. Sum the series sin x 4- sin 2x 4-f- sin nx. 


. Evaluate J cos bz dz and J c"* sin bx dx by considering dx and equating 

e“*c»**(a - tb) 


real and imaginary parts. Hint: . , 

.2T « + 


+ 

4 . Evaluate / (2 cos x)^ dx by using the formula 
Jo 


2 cos X =* e** -b 

together with (17-8). 

6. Sliow’ that every complex number z may be written in the form z » rc*^, where 
f > 0 and 0 < 5 < 2ir. Hint: If z x ■+■ zj/, introduce polar coordinates (r,^), so that 
X *» r cos and y « r sin 

6* (a) For 0 < r < 1 obtain the expansion 


--- (cos nfi -b I sin nff) 

1 ~ rc*^ 


by letting z »« in the series for 1/(1 — z). 

(b) Separate 1/(1 — re*^) into real and imaginary parts, by noting that 


l—rc*^ 1 1-r cos $ ir sin 0 

1 — 1 — re'^ 1 — 2r cos (? 4* 


(c) From (a) and (6) deduce that 


1 — r 008 0 

1 — 2r cos 0 4* ^ 

r sin 0 

1 — 2r COB 6 -b r* 


»* 53 r** cos n^, 

n>M^ 

00 

» 5^ r** sin n0. 


0 < r < 1, 
0 < r < 1. 


[The first series of (c) is an example of a Fourier cosine series^ and the second is a Fourier 
stne series. The study of such series by real-variable methods forms the topic of the next 
eight sections.] 
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FOCnUER SERIES 

18, The Euler-Fourier Fonnulas. Trigonometric series of the form 

00 

/(x) J^ao + 23 (®n COS 7 ix + sin nx) 


nwfl 


(18-1) 


are required in the treatment of many physical problems, for example, in 
the theory of sound, heat conduction, electromagnetic waves, electric cir¬ 
cuits, and mechanical vibrations. An important advantage of tlie series 
(18-1) is that it can represent discontinuous functions, whereas a Taylor 
series represents only functions that have rlerivauves of all orders. 

We take the point of view that/(x) in (18-1) is known on ( — tt, tt) and 
that the coefficients an and are to he found. In order to determine ao, 
we integrate (18-1) term by term from —tt to tt. Since 


/: 


cos nx dx 

r 

the calculation yields 


■ f sin nx dx == 0 for n = 1, 2, .. 

J —ir 

j f(x) dx = aoTT. 


(18-2) 


The coefficient o„ is determined similarly. Thus, if we multiply (18-1) by 
cos nx, there results 

/(x) cos nx = cos nx -h an cos^ nx d-, (18-3) 

where the terms not written involve products of the form sin mx cos nx or 
of the form cos mx cos nx with m 9 ^ n. It is easily verified ^ that for inte¬ 
gral values of m and n, 

[ sin mx cos nx dx == 0, in general, 

f COS mx cos nx dx = 0, when m, 9 ^ dbn, 

IT 

and hence integration of (18-3) yields 

f f{x) cos nx dx == On f cos^ nx dx =*=* o„7r. 


Therefore, 


1 yir 

an ^ - I fix) cos nx dx, 

fr •'—IT 


(18-4) 


By (18-2), this result is also valid ’ for n •= 0. 

* See Example 2 of Sec. 17. 

»That is the reason for writing the constant term as ^oo rather than oo. 
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Similarly, multiplying (18-1) by sin nz and integrating yield 


hn 


1 fr 

- / f(x) sin nx dx, 

T 


(18-4a) 


The formulas (18-4) are called the Euler-Fourier formulas^ and the series 
(18-1) which results when «« and 6« are determined by the Euler-Fourier 
formulas is called the Fourier series of f{x). More specifically, a Fourier 
series is a trigonometric series in which the coefficients are given, for some 
absolutely integrable function ^fix), by (18-4). 


The distinction between a convergent trigonometric senes and a Fourier series is im¬ 
portant in the modern development of the subject and is a genuine distinction. For 
instance, it is known that the trigonometric series 


00 

z 


sin nx 
log (1 + n) 


is convergent for every value of x without exception, and yet this series is not a Fourier 
series. In other words, there is no absolutely integrable function f(x) such that 



cos nx dx 


0 , 



ain nx dx 


TT 

Fog (1 -j- n) 


On the other hand a 8f*ries may be a Fourier series for some function/(x) and yot diverges. 
Although such functions are not considered in this book, they ofbm arise in practice, 
for example, in the theory of Brownian motion, in problems of filtering and noise, or in 
analysing tlie ground return to a radar system Even when divergent, the Fourier series 
represents the main features of/(x), and for this reason Fourier series are an indispensable 
aid in problems of the sort just mentioned. 

Treatises devoted to Fourier series commonly replace the sign of equaKty in (18-1) 
by or some similar symbol to indicate that t he sentts on the right is the Fourier 

series of the function on the left. We shall continue to use the equality sign because the 
series obtained in tliis book do, in fact, converge to the function from which the coeffi¬ 
cients were derived. 


To illustrate the calculation of a Fourier series, let f(x) « x. By Eqs. 

(18^) ^ 

On ~ f X COS fix dx = 0, 


1 r 
in--i 

nr •'—< 


2 2 

X sin nxdx — -cos nr ^ - ( — 1)”"^^ 

n n 


so that, upon substituting in (18-1), 


— 2 ^sin X 


sin 2x sin 3x 

—r- + — 


(18-5) 


In Sec. 24 it is shown that the series (18-5) does converge to x for — ir < 
X < r. To discuss the convergence outside this interval, we introduce the 
notion of periodicity. A function /(x) is said to be periodic if /(x + p) 
‘ This means timi l/(x) i, as well as/(a:), is integrable. 
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« f{x) for all values of x, where p is a nonzero constant. Any number p 
with this property is a period of f{x ); for instance, sin x has the periods 2 t, 
~'2ir, 4tr, .... 


Now, each term of the series (18-5) has period 2tr, and hence the sum 
also has period 2 w. The graph of the sum therefore has the appearance 
shown in Fig. 11. Evidently, the sum is equal to x only on the interval 
—IT < X < TT, and not on the whole interval —oo < j < oo. 

It remains to describe what happens at the points x = ix, dbSTr, ..., 
where the sum of the series exhibits an abrupt jump from —tt to -f-ir. 



Upon setting x = dzT, d::37r, ... in (18-5), we see that every term ia zero. 
Hence the sum is zero, and this fact is indicated in the figure by placing a 
dot at the points in quest ion. 

The term Un cos nx -}- hn sin nx in (18-1) is sometimes called tlio nth Itarmonfc (from 
analogy with the theory of musical instruments). The first four harmonics of the series 
(18“5) are 

2 sin X, — sin 2 j, fa sin 3x, ~~ H sin 4x. 

These and the next two harmonics are plotted as the numbered curves in Fig. 12. The 
sum of the first four harmonics is 

2 / » 2 sin X — sin 2x 4* sin 3x — sin 4x. 

Since Uiis is a partial sum of tiie Fourier series, it may ho expected to approximate the 
function x. The closeness of the approximation ia indicated by the upper curves in Pig. 
12, which show this partial sura of four terms together with the sums of six and ten 
t<erms As the number of terms increases, the approximating curves approach y «» x for 
each fixed xon -“ir <x <7r but not for x ~ ±7r. 

The foregoing example illustrates certain features which are character¬ 
istic of Fourier series in general and which will now be discussed from a 
general standpoint. Each term of the series (18-1) has period 2ir, and hence 
iff(x) is to be represented by the sum, then/(x) must also have period 2ir. 
Whenever w^e consider a series such as (18-1), we shall suppose that f(x) is 
given for — ^r < X < IT and that outside this interval /(x) is determined by 
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the periodicity condition 

J{x + 2t) - /(X). 

Of course, any interv^al a < x < a + 2ir would do equally well. 

The terra simple discontimdty ' is used to describe the situation that 
arises when the function f(x) suffers a finite jump at a point x = xq (see 
Fig. 13). Analytically, this means that the two limiting values of/(x), as 
X approaches xq from the right-hand and the left-hand sides, exist but are 
unequal; that is, 

lim/(xo + e) 9^ limfixQ — c), c > 0. 

In order to economize on space, these right-hand and left-hand limits are 
written as /(xo+) and /(xq —), respectively, so that the foregoing inequal¬ 
ity can be written as 

f(Xo+) 5^/(Xo-). 

A function/(x) is said to be hounded if the inequality 

\m\ < M 

^ For an example of a discontinuity which is not simple, consider sin (1/x) near x «■ 0. 
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Fig. 13 


Fig. 14 


holds for some constant M and for all x under coasideration. For example, 
sin I is bounded, but the function 

1 

/(a:) = for i ^ 0, 

X 

m - 0 , 

is not, even though the latter is well defined for every value of x. It can 
be shown that if a bounded function has only a finite number of maxima 
and minima and only a finite number of discontinuities, then all its dis¬ 
continuities are simple. That is, /(x+) and fix — ) exist at every lvalue of x. 

The functions illustrated in Figs. 11 and 13 satisfy these conditions in ever/ finite 
interval. On the other hand, the function Hin(l/jr) has infinitely many maxima near 
X * 0, and as we have noted, the discontinuity at j « 0 is not simple. The function 
defined by 

f(x) = a;^sin -> x 9 ^ 0, 

X 

/(O) * 0 

also has infinitely many maxima near a; *= 0, although it is continuous and differentiable 
for every value of x. The behavior of tnese two functions is illustrated graphically in 
Figs. 14 and 15. 

i,y 

/ 
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With these preliminaries, we can state the following theorem, which 
establishes the convergence of Fourier series for a very large class of func* 
tions: 

Dibichlet’s Theorem. For —7r<x<^ suppose f(x) is well defined^ is 
hounded^ has only a finite number of maxima and> minima^ and has only a 
finite number of discontinuities. Let f(x) be defined for other values of x by 
the periodicity condition f(x + 27r) » /(ar). Then the Fourier series for f(x) 
converges to 

H[/(x+) +f(x-)] 

at every value of x [and hence it converges to f(x) at points where f{x) is con-- 
tinuoiLs]. 

The conditions imposed on f{x) are called Dirichlet conditions y after the 
mathematician Dirichlet who discovered the theorem. In Sec. 24 we estab¬ 
lish the conclusion under slightly more restrictive conditions which are 
sufficient, however, for almost all applications. 

Example 1. Find the Fourier series of the periodic function defined by 
fix) * 0, if -vr < X < 0, 

fix) » X, if 0 < X < X. 

By (18-4) we have 

0 dx -h J X dx^ *“ X, 

1 

On ^ - I ^ COS nx dx *» 0, n > 1, 

X Jo 

1 1 

6„ ~ / X sin nx dx » ~ (1 — cos nx). 

X Jo w 

The factor (1 — cos nx) assumes the following values as n increases: 


n = 

1 

2 

3 

4 

1 

5 

•• 

(1 — cos nx) 

2 

0 

2 

0 

2 



Determining bn by use of this table, we obtain the required Fourier stiries 


+ 2 


( sin 

“T 


X sin 3x 

~ H- 


sin 5x \ 


The graph of /(x) consists of the x axis from — x to 0 and of the line AB from 0 to x 
(Fig. 16). There is a simple discontinuity at x « 0, at which point the series reduce! 
to x/2. Since 

z„fJ!t±±M±h 

2 ” 2 


this value agrees with Dirichlet^s theorem. Similar behavior is found at x «*■ ±t, 
5b2x, .... 
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The figure shows the first four partial sunin, \\ho8e equations are 

sin 3x\ 


y + 2 sin X, 




sin z 4* ' 


^sir 


, . sm 3x sin 5x 
+ 2 ( sm z -f —4- — 
d o 


)■ 


For most functions it is only the infinite series that reduces to 4-/(3c4)I at 

points of discontinuity. In the present example, however, this condition is sati^^iod by 
the partial sums, as the reader can verify. That is, the graph of each partial sum contains 
the points (0 ,t/ 2), (drjr,x/2), .... 

Example 2. Find the Fourier series for the periodic function /(z) defined by 
fix) « -T, if --r < z < 0, 

fix) * X, if 0 < z < “JT. 


The integral (18-4) may be expressed as an integration from — ir to 0, followed by 
integration from 0 to ir. If the appropriat,e formula for fix) is used in these two inter¬ 
vals, w'e get 


(In 


TT COS nxdx j X cos nx dz j 


-uc 

1 / cos nx 1 \ 1 / 


cosnx — 1> 


The integration assumes that n 0; if n « 0, we get oq » — t/ 2, as the reader can 
verify. Similarly, 


, i. /l[ — ^ coe nx-cos nx^ ~ (1 — 2 cos nx) 

X \n n n / n 




1S2 

Therefore 
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fix) 


2 cos 3z 2 cos 5a; 


^ . sin 2 j: ,3 sin 3ar 
+ 3 sin X-— + —-- 


sin 4a; , 3 sin So; 

A * S' 


When X « 0, the series reduces to 

T 2 / 

4 IT V]2 


1 1 

+ p + 5»+- 


which must coincide with (see Fig. 17) 

/( 0 +)+/( 0 -) 

-- -- oc —. • 

2 2 



Thus 

Hence 


; /I 1 1 \ 

V,1 + 31 + 52+- ; 


111 jr^ 

P + P + 5 i+--- “-g- 


This example suggests the use of Fourier series in evaluating sums of series of constanf 


PROBLEMS 

1. Evaluate j cos mx cos tix cb for integral m and n by use of the identity 

2 cos A cos B - cos (A -f B) -f cos {A ~ B). 

2. Find the Fourier-series expansion for /(x), if 


/(X) - 

for —r < X < ~» 

2 

/(x) “ 0, 

. r 

for ~ < X < IT. 

/(X) - -X, 

for —ir < X < 0, 

fCx) - 0, 

for 0 < X < ir, 




6. In Probs. 2 to 5, sketch the graph of the function to which the Fourier series con¬ 
verges in the range — 4ir < x < 4 t. 

19, Even and Odd Functions. Ftir many functions the Fourier sine or 
cosine coefficients can he determirif^d by inspection, and this possibility is 
now to be investigated. A function f{x) is said to be even if 

fi-x) ^fix)y (19-1) 

and the function f{x) is odd if 

K-x) ^ ~*/(x). ri9.2) 

For example, x^ and cos x are even, whereas x and sin x are odd. The 
graph of an even function is symmetric about the y axis, as shown in Fig. 
18, and the graph of an odd function is skew symmetric (Fig. 19). By 


inspection of the figores it is evident that 



f f{i) dr = 2 f f(x) dx 

J —a JQ 

if fix) is even, 

(19-3) 

f fix) dx — 0 if/(x)is 

J —a 

odd, 

(19-4) 



Fig. Xa 
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Fig. 19 


since the integrals represent the signed areas under the curves.* For 
example, 

f sin nxdx Of 
J^a 

since sin nx is an odd function. 

Products of even and odd functions obey the rules 

(even) (even) = even, (even)((xld) — odd, (odd) (odd) « even, 

which correspond to the familiar rules 

(+!)(+!) == +1, (+l)(-l) « -1, (-~1)(-1) « +L 

For proof, let F{x) = /(x)^(x), where/(a;) and g{x) are even. Then 

F{-x) f{-x)g(-x) f{x)g{x) « F{x), 

which shows that the product f{x)g(x) is even. The other two relations 
are verified similarly. As an example, the product cos nx. sin mx is odd 
because cos nx is even and sin mx is odd. Hence, (19-4) gives 

f cos mx sin nx dx ^ 0 
J —a 

without detailed calculation. 

The application of these results is facilitated by the following theorem: 
Theorem. If f{x) defined in the interval —ir < x < ir zs eveUy the Fourier 
eeries has cosine terms only and the coefficients are given hy 

2 fv 

o>n ^ i /(^) cos nx dXy bn » 0. (19-5) 

T •'0 

* An analyiac proof of (IM) and (19-4) may be based on (19-1) and (19-2). 
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The function to which this series converges is illustrated in Fig. 22, and the sum of the 
series (1^7) is presented graphically in Fig. 11. 



8 ince |x| = x for x > 0, the two aeries (19-7) and (19-8) converge to the 
same function x when 0 < x < ir. The first expression (19-7) is (‘ailed the 
Fourier sine series for x, and (19-8) is the Fourier cosine series. Any func¬ 
tion /(x) defined in (0,7r) which satisfies the Dirichlet conditions can be 
expanded in a sine series and in a cosine series on 0 < x < tt. To obtain a 
sine series, we extend/(x) over the interval —tt < x < 0 in such a way that 
the extended function is odd. Tliat is, we define 

F(x) = /(x) on 0 < X < T, 


F(x) = ~"/(|x|) on —TT < X < 0. 

The Fourier series for F(x) consists of sine terms only, since F(x) is odd. 
And the coefficients are given by (19-6) because F(x) = fix) on the interval 
0 < X < TT. Similarly, if it is desired to obtain a cosine series for f{x) on 
0 < X < TT, the ccxifficients are given by (19-5). 


Exatuple’ Obbiin a cosine series and also a sin(? senes for sm x. 

For the cosine series (19-5) gives bn ^ 0 and, after a simit calculation, 

2 . 2(1 + COH Vir) 

an ^ - sinxcosnxai = - x ;—» n 1. 

TT Jo Tr(] — rr) 

For n ** 1 the result of the integration is zero, and hence 


2 4 / cos 2x cos 4x cos 6x \ 

sm X »-I -r-- + -f- --) 

T TT \22 - 1 4^ - 1 h* - 1 / 


when 0 < X < ir. Since the sum of the series is an even function, it convergers to 1 sin x | 
rather than sinx when — t < x < 0. This shows, by periodicity, that the Berie.s con¬ 
verges to I sin XI for all values of x. 

To obtain a sine series, (19-6) gives an ** 0 and 


bn 


h 

Jo 


sin X sin nx dx 


C: 


for n > 2, 
for n » 1. 


Heiwse the Fourier sine aeries for sin x is sin x, just as one would expect. That this is 
not a coincidence is shown by a Uniqueness Theorem: If two trigonometric series of 
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the form (18-1) eoraierge to the eame turn for all values of x, then corresponding coefficients 
are equal} 


PROBLEMS 


1 . Classify the following functions as even, odd, or neither: 

1 4* X 

X sin X, x’ cos nx, x^, log-> e*, 

1 — X 

2 . Prove that any function can be represented as the sum of an even function and an 
odd function. Hint: f{x) * H{/(x) -[-/(-x)] + hif(^) 

3. For 0 < X < ir show that 


T 

4 


sin 3x sin 5x 

smx + -fj-.... 

o 5 


Hint' Take/(x) « 7 r /4 in (10-6) 

4. A function is defined by/(x) *= t forO < x < ir/2 and f(x) =“ 0 elsewhere in (—ir.x). 
Find the Fourier serii's, the Fouriei sme series, and tlie h'ouner cosine series. In each 
case sketch the graph of the sum of the series ior — 4ir < x < 4ir. 

6 . By taking/(x) x* m (lO-fi'k show that 


a n »=1 n 


for — TT < X < T, and deduce that 


12 


I® 


1 1 
+ p 


6 , Show that if /(x) * x for 0 < x < ir /2 and /(x) ~ tt — x for Tr/2 < x < tt, then 


/(/) 


( COM 

“T 


COM 2x (*O.S 6x COM 1 Ox 

+ ^r- + -^+- 


7. Show that for — ir < x < tt, 


sin TTor ^ V 2ar sin ira 
cos ax «- y 1) / ?T COS nx, 

Tra n~i ~ ir) 


when a IS not an integer. 


Deduce 
cot T« *= 


1 

T 




20. Extension of the Interval. The methods developed up to this point 
restrict the interval of expansion to (~7r,7r). In many problems it is de¬ 
sired to develop /(.r) in a Fourier series that will be valid over a wider inter¬ 
val. By letting the length of the interval increase indefinitely one may 
expect to get an expansion valid for all x, 

* This theorem is due chiefly to Riemann. It is much deeper than the analogous .state¬ 
ment for power seriea, and the prot^f would be quite out of place in the present book. 
Bee E. C. Titchmarsh, “The Theory of Functions,” pp. 427-432, Oxford University 
Press, London, 1950. 
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To obtain an expansion valid on the interval (--Ift), change the variable 
from X to fo/ir. If f(x) satisfies the Dirichlet conditions on (—1,?), then the 
function f{lz/ir) can be developed in a Fourier series in 


/I - j - h 2^ Oh cos nz + Z^bnSinnz 

\ir/ 2 n«-l n^l 

for —r < z < v. Since z ~ trx/ly the series (20-1) becomes 


Oo _ nirx 

, = — + 2^ On cos — + 2^ K sin 


nirx 


mrx 


2 ■ I n«l 

By applying (18-4) to the series (20-1)^ we see that 


(20-1) 


(20-2) 


On 


1 rr /lz\ 1 fl 

*” / /1 ) cos nz dz = - I fix) 

V \ir/ I ^ 


nirx 

COS - dx 

I 


and 


1 rr /lz\ ^ 

bn — - f [ — } sin nzdz — - fix) sin- dx. 

T Vtt/ I I 


As an illustration we develop /(x) in Fourier series in the interval (—2,2) if f{x) « 0 
for —2 < X < 0 and f(x) «* 1 for 0 < x < 2. Here 




Therefore, 

f 

If n is any integer, then 
ntr(x + 2?) 


riTi 

dx 4“ 


nvx , \ 

^ 1 

0 • cos -~~ 

/1 

• cos - dx\ 

BSt —- 

i 2 

/o 

2 / 

nv 

. nrrX 

dx 4- 


nrx \ 

1 

0 • sin 

/ 1 

. am 

-dx] 

av - j 

i 2 


2 / 

nv 

1 . 2/_, 

ITT 

1 

SttX 

1 . 

5xx 


5 2 


cos 




( nwx 

-1“ 2nir 

I 


) /nwx\ 
=* cos 


and similarly for sines. Hence, each term of the series (20-2) has period 
21, and therefore the sum also has period 21. For this reason the sum can¬ 
not represent an arbitrary function on ( — 00 , 00 ); it represents periodic func¬ 
tions only. 

Subject to the Dirichlet conditions, however, the function may be chosen 
arbitrarily on the interval (~Z,Z), and it is natural to inquire if a representa¬ 
tion for arbitrary functions on (— qo,qo) might be obtained by letting i 00 . 
We shall that such a representation is possible. The process leads to 
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the scHjalled Fourier integral theorem^ which has many practical applica¬ 
tions.* 

Assume that/(a:) satisfies the Dirichlet conditions in every interval ( —1,1) 
(no matter how large) and that the integral 

\f{x)\ dx 

• ■*“*'^10 

converges. As we have just seen, f{x) is given by (20-2), where * 

1 f* -/X 1 /■* 

Cln ^ - I f{i) COS — di, hn ^ 7 / f{t) BlXi — dt. 


Substituting these values of the coefficients into (20-2) gives 


If* 1 ^ r' 


COS ■ 


mr(t — x) 


di 


when we recall that 


mri nirx ^ nrt n^x nr{t — x) 

cos — cos-h sin — sin-— cos- 

I I I I I 


Since 



|/(j) I dx is assumed to be convergent, 


1 

21 



Ifi M 

<“• / /W dt<—^ 
2lJ-r ^ 21 


(20-3) 


which obviously tendvS to zero as Z is allowed to increase indefinitely. Also, 
if the interval ( — 1,1) is made large enough, the quantity tt/Z, which appears 
in the integrands of the sum, can be made a? small as desired. Therefore, 
the sum in (20-3) can be written as 


1 /■* 

" [Aa I f{t) cos Aa{t — x) di 

TT 


Aaj ^f(t) COS 2 Aa{l — x) dt 

+. 

+ Aa/^ /(/) COS n Aa{t — x) di 

+.], (20-4) 

where Aa *= w/l. 

‘ Some of theae applications are presented in Chap, 6. 

• W© use t as variable of integration to avoid confusion with th© x in (20-2). If f(t) 
is discontinuous at x «• xo, the left side of (20-2) means ^[/(xo-f) 4-/(^o-“)l. 
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This sum suggests the definition of the definite integral of the function 
F{a) j ^f{t) cos a{t x) dt 

in which the values of the function F{a) are calculated at the points 

V 2% Sir 

T T’ T' ““ 

NoW; for large values of I 

j ^J(t) cos a{i — x) dt 


differs little from 


/ f{t) cos a{t — x) 
J - 00 


dt 


and it appears plausible that as I increases indefinitely, the sum (20-4) will 
approach the limit 

I 

“ / dal f(t) cos a{t — x) dt. 

If such is the case, then (20-3) can be written as 

f{x) = ~ r da r f{t) cos a(t — x) dt. (20-5) 

^ Jq — 00 


The foregoing discussion is heuristic and cannot be regarded as a rigorous 
proof. However, the validity of formula (20-5) can be established rigor¬ 
ously ^ if the function f{x) satisfies the conditions enumuated above. The 
integral (20-5) bears the name of the Fourier inkgral. 

Formula (20-5) assumes a simpler form if f{x) is an even or an odd func¬ 
tion. Expanding the integrand of (20-5) gives 


1 

TT 



COS at cos ax dt + f f(t) 
J —00 


sin at sin ax dt 




for the right-hand member. If f(t) is odd, then /(/) cos at is an odd func¬ 
tion times an even function, hence is odd. Similarly, f(t) sin at is even 
when/(0 is odd. Upon applying (19-4) to the first integral in the foregoing 
expression and (19-3) to the second integral, we see that 




r r(X3 

_ da I Jit) 


sin at sin ax dt 


(20-6) 


* See H. S. Carslaw, ^‘Fourier’s Series and Integrals/’ pp. 283-294, The Macmillan 
Company, New York, 1921, or E. C, Titchmarsh, “The Theory of Functions/’ p. 433, 
Oxford University Press, London, 1950. 
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/(x) ^ (20-7) 


If /(x) is defined only in the interval (0,oo), then both (20-6) and (20-7) 
may be used, since /(x) may be thought to be defined in (~co,0) so as to 
make it either odd or even. This corresponds to the fact that a function 
given on (0,ir) may be expanded in either a sine series or a cosine scries. 

Since the Fourier scries converges to 3^1/(x-t-) + /(x — )] at points of dis¬ 
continuity, the Fouri(‘r integral docs also. In particular for an odd func¬ 
tion ^ the integral converges to zero at x = 0, and this fact is verified by 
setting X = 0 in (20-()). 

Example. By (20-7) obtain the formula 

ir/2, if 0 < a; < b 

ir/4, if J *= 1, 

0, ifx>l. 

We choose/(j) « 1 for 0 < a; < 1 and f{x) «= 0 for x > 1. Then 

., r , sin a 

I f{t) cos at (it ^ I COB at (it * - } a 9^ 0. 

Jo Jo a 

Substitution into (20-7) gives 

r* sin a r 

/-cos ax da — ~ f(x) 

Jq a 2 

after multiplying by 7r/2. Upon recalling the definition of /(j), we see that the desired 
result is obtained for 0 < x < 1 and for x > 1. The fact that the inU*gral is ir/4 when 
X « 1 follows from 

1 /(I -) 4-/(1+) 

2^ 2 


f sin 


sin a cos ax 


da 


PROBLEMS 

1. If fix) is an odd function on (—/,/), show that the Fourier series takes the form 

^ nirx 2 nwx , 

f{x) - 2^ b„ sin —» « ~ / fix) sin dx. 

n.-! I Wo I 

Similarly, if /(x) is even, then 

V , v-' aijrx 2 J,, V TiTcx 

fix) ** ““ + 2., aw cos —; an - fix) cos — dx. 

* n»l ' I Jo t 

^ It should be noted tliat every odd function, if defined at x * 0, satisfies /(O) « 0 
[although for an even function /(O) is arbitrary]. Hen(;e a function defiined for x * 0 
must sometimes be redefined at x « 0 before it can be made into an odd function. 
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ft* Ex;imnd tiie fua^tion defiaad by/(x) *» 1 oa (0,2) aad/(x) *• — I on ( —2,0). 

3. Expand/(j) « Ia;| in the interval (—1,1). 

4, Expand/(x) » cos ttx in the interval (—1,1). 

6. Find the expansion in the series of cosines, if 

f(x) «» 1, when 0 < x < »•, 

fix) 0, when ir < x < 2t, 

Hint: Regard/(x) as being an even function. 

0. Expand 

fix) >* — X, if 0 < X < 

fix) * X — if H < X < I. 


7. Show that the series 


I ^ 1 2nTX 

- 1. " «« 7 ' 

IT n«l n I 


represents — x when 0 < x < L 

8. Show, vith the aid of (20-6) and (20-7), that 

f a sin «x . -r „ 

-- da « - if /? > 0, 

. 2 

r cos ax , ^ 

Htnf; Take /(x) e’~^. 

9. An integral equaiion is an equation in wliich an unknown function appears under 
an integral sign. If Fit) is known and fix) is to l)e found, the integral equation of Fourier is 


\ fix) cos xt dx »* Fit). 
Jo 


(a) Using (20-7) show that a solution is given by 


/(x) « ~ / F(i:) cos xt dt. 
rr Jo 


ib) State a similar integral equation which can be solved by use of (20-6), and 
solve it. 

21. Complex Form of Fourier Series. The Fourier K(U’ies 


UQ _ 

fix) =«-h 2-f sin nx) 

2 71^1 


with an — - f f{t) cos nt dt^ K — - f f{t) si 

TT */—ir If J^-K 

can be written, with the aid of the Euler formula * 


sin nt dt 


« cos u + i sin u 


‘ See Sec. 17. 
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in an equivalent form, namely, 

fix) = Z 

where the coefficients c„ are defined by the equation 

” :r r 

2ir 


id3 

(21-3) 

(21-4) 


T})e index of summation n in (21-3) runs through the set of all positive and 
negative integral values including zero. 

I’he equivalence of (21-3) and (21-1) can be established in the following 
manner: Substituting from (21-2) in (21-4) gives, for n > 0, 


= — r /(Q(co8 nt — i sin nt) dt 
2ir 

1 i ^ 

= — j f(t) cos nt dt -/ f(i) sin ni dt 

2 t 27r •'-» 


2 


A similar calculation gives 


«n . K 
C_n = — 4- ^ 

2 2 


Oo 

¥ 


while Co 

Now (21-3) can be written in the form 

OC 00 

/(•r) = Co + E fnC'"* + E C-nC-'"*. 

natal “ ^ 


n w=l 


Making use of the expressions for the c„ just found gives 


fix) 


an — ibn 


e- + E 

^ na»l 


2 + ^ 

* n^l 

0,, " e”** + e“*"* 

2 + 2?- - 

^ n1 “ 


«n + 


- i 'E, bn 

n«=«.l 




By (21-2), 

-f =: 2 COS w and = 2f sin u 

and hence the latter series is identical with (21-1), 
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To illustrate the use of (21-4)» coasider the function f(x) *« «*** on (-“«r,ir). Here, 


27rCi 


-r, 


dt 


• f 

J —w 


"'•‘dt 


g{ct-<-in)i jir 




a — in \ .r 

Since (21-2) gives ** cos (±tiir) » ( — I)", we obtain 

(-!)« sinhira (-*!)« 


Cn « 

and hence by (21-3), 


2v a — in 
, smh tta 


~ T~r 2 

+ ir 


ir «..--oo ot"* + n 

The methods of the last section yield 


/„nn 

^ ZsTl ..2 


(21-5) 


m = £ with r„ = f f{l)e-'”^‘" dt, (21-G) 

riBc—Qo 2/ ^ 

for the expansion on an arbitrary interval ( — ?,/) Upon letting I oo^ we 
obtain the Fourier integral theorem in the fonn 


f{x) = lim f da r f{i)e 


,ta(x—0 


dt 


when/(x) satisfies the conditions postulated for (20-5). 
If 


g{u) = r 

-y/J tt 


dj, 


then (21-7) gives, after renaming some of the variables, 

1 fA 

/(j) = iirn —~r= / e***'^^(w) du. 

A » V 27r 

The transform T defiru^d by 


(21-7) 


( 21 - 8 ) 


(21-9) 


T(/)=-Ur e—/(T)rfz 

V27r •'-« 


is called the Fourier transform; it is one of the most powerful tools in the 
whole repertoire of modern analysis. Although T is related to the Laplaee 
transform L introduced in Appendix B, T is much easier to invert; that i.s, 
one can readily find/(x) by (21-9) when T(/) is knowm 


PROBLEMS 

1, Derive (21-6) from (21-3) and (21-4) 

2. (a) Show that 

UO *■ 2Co, Ufi C» Hh C_ny hn t(Cn — C—n) 

(and hence the real form (21-1) can be deduced from the complex form (21-8)]. 
(b) By applying your result (a) to (21-5) obtain the real Fourier series (21-1) for e®*. 
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8. By setting x »« 0 in (21-5) obtain the expansion 

^ ^ (~1)" 

... ■- as y . . .. 

a sinh va n—i* "b 


ADDITIONAL TOHCS IN FOURIER SERIES 

22. Orthogonal Functions. A sequence of functions Bn{x) is said to be 
orthogonal on the interval (a,5) if 


f e„{x)9„(x) , 

Jn 


for m 9 ^ Uy 
for m ^ n. 


r Bm{:r)dn{T) dx — f sin mx sin nx dx = I 

•'o i tt/: 


For example, the se(|uenee 

Bi{x) = sin Xy B 2 {x) = sin 2 x, .. Bn{x) == sin nXy 

is orthogonal on (0,7r) because 

r r^ . (0 for w 5 ^ n, 

Bm{x)6n{^) dx = / sin mx sin nx dx = { 

[ 7r/2 for m = n. 

The sequence 

1, sin X, cos X, sin 2x, cos 2x, ... (22-2) 

is orthogonal on (0,27r), though not on (0,7r). 

In the foregoing sections the fun(‘tions (22-2) were used to form Fourier 
series. Actually, one may form series analogous to Fourier series by means 
of any orthogonal set. The.se generalized Fourier series are an indispensa¬ 
ble aid in ele(‘tromagnetic theory, ai'oustics, heat flow, and many other 
branches of mathematical pliysics.^ 

The formula for Fourier coefficients is especially simple if the integral 
(22-1) has the value 1 for m — n. T'he functions 0„ix) are then said to be 
normahzedf and {^n(-r)i ivS called an orihonorrnal set. If 


Ja 


in (22-1), it is easily seen that the functions 

<i>n{x:) - (4n)~X(x) 


are orthononnal; in other words, 


rb 

I 4*m {^' 

Ja 


= 0 for m n, 

= 1 for w = n. 


For example, since 


r2v r2% r2ir ^ 

/ 1 fix = 27r, / sin^ nx dx = tt, / cos" nx dx =» t 

Jo Jo Jo 


^ See Chap. 6. 
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forn > 1, the orthoHormal set corresponding to the orthogonal set (22-2) is 
gjjj , ir“”^ sin n.r, cos nx,_ 


The product of two different functions in this set gives zero but the square 
of each function gives 1, when integrated from zero to 2ir, 

Let {(l>n{x )} be an orthonornial set of functions on (a,6), and suppose that 
another function f{x) is to be expanded in the form 

f(x) = Ci4>i(x) + C 2 <t> 2 (x) d - b Cn4>rt(x) H -. (22-4) 

To determine the coefficients Cn we multiply by <t>nix), getting 

f{r)<l>n{T) = Ci<l)iix)<t>„ix) H-h Cn[<#>„W]* H-. 

Here, the terms not written involve products <l>nix)<i>m{x) with m ^ n. If 
wo integrate from a b, these terms disappear, and hence 

f fix)(f>n(x) ds ^ f rn[4>nix)f dx == c„ (22-5) 

Ja Ja 


According to Tiieorera Til, Sec 7, the term-hy-term inlegiation is justiBed when the 
aeries is uniformly convergent and tlie functions are continuous Tli(‘ foregoing pro¬ 
cedure sIkws that if/(x) hab an expansion of the desned type, then the coofhnents r„ 
must be given by (22-5). In the following sect ion (22-5) is obtained m a different manner, 
which does not asbume uniform convergence 


The formula (22-5) is called the Euler-Fourier formula, the coefficients 
Cn are called the Fourier coefficients of /(.r) with resp(‘ci to {</>„(j:) j, and the 
resulting series (22-4) is called the Fourter senes of /(x) with re.sjx'ct to 
The reader can verify that the fon^gmng results applied to the 
sequence (22-2) >ield the ordinary Fourier series, as described in tlie fore¬ 
going sections. 

Orthogonal sets of functions arc obtained in jiractice by solving differen¬ 
tial equations, and this possibility will be discussed next On a given inter¬ 
val a < X < h consider the c(]uation 


d 

dx 



+ (l{i)y = Xr(x)2/, 


X == const, 


( 22 - 6 ) 


or, in abbreviated form, 

(py'Y 4- 92/ = Xrv, 


d 

Tx 


It will be convenient to require the additional condition 

f ry^ dx 0 


which, in particular, rules out the trivial solution y » 0. 
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Let ym be a solution when X has the value X^, and let ynhe a solution 

when X has a different value, Xn. Thus, 

(PJ/m)' + = Xmrj/m, (22-7) 

(py'nY + qVn- x„ry„. (22-8) 

If (22-7) is multiplied by yn and (22-8) by we get 

ynipl/mY - yrnipynY = Knrymyn ^ KrymPn (22-9) 


after subtracting the resulting expressions. Since 

IVnipym) - ymiVUn)]' = VnipymY + VniVlJm) ^ ymiplJnY 
— left side of (22-9), 

the foregoing result (22-9) may be written 
d 

fp(l/n.Vm 2/m?/n)] ^ O^m ^n)^ymyrf 

ax 


ymipVn) 


Integrating from a to h yields the fundamental formula 



when r is continuous. 

If the conditions at a and h are such that the left side of 
we can deduce 


( 22 - 10 ) 


(22-10) is zero. 


/: 


rUmlJn dx = 0, 


m 9 ^ n, 


(22-11) 


since X^ ^ X,,. The relation (22-11) may be written 

f i/,„)(Vr y„) tlx = 0, m 9 ^ n, 

Ja 

and hence the sequence dn{^) de(iTH‘d by 

On{x) = 'V'^i/n == \/'r{x}yuix) (22-12) 

satisfies the orthogonality criterion (22-1 j. An orthonormal set {<^„} may 
be obtained from {0nl as described previously. 


When r{x) is negative, the foregoing proc'ess does not yield a rail sequence {<?n(j") j, and 
it is better to work directly with (22-11). Functions ty„ satisfying (22-11) arc said to be 
orthogonal with respect to the wnqhiing funrtum r(x), the definition (22-1) corresponds to 
the case r m Fourier siTies basi'd on the mon* general conccjit of orthogonality (22-11) 
are quite analogous to those based on (22-1) (cf. Frob 2) 

Example 1 Show that the sequence (22-2) is orthogonal on the iuterv'ol ( —7r,7r). 
Since ain 7ix and cos nx satisfy « ~n^y, we may use the formula (22-10) with 
p f M 1, The result is 


iVnyln - VmVn) 


VmVndx, 

X—T 


(22-33) 
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where yn « sin m or cos nx and j/w « ain mx or cos mx. Since ynt/m PmPn has period 
2r, tlie value at r is the same as the value at ~ir, and hence the left side of (22-13) is 
zero. This yields the desired orthogonality except in the case m « w. If 7n «* n, how¬ 
ever, the relevant integral may be evaluated by inspection: 


i: 


cos riT sin nx dx 



sin 2nx dx 


0 . 


Example 2. Show that the Legendre functions Pn{x) are orthogonal on the inter- 
v»l (-1,1). 

Legendre’s equation (13-8) may be wn-itUm 

[(1 - x^)yr - Xy, 

where X is constant; X ~n(n -j- 1) when y ^ Pn{x). The special case p «« (1 — x^), 
^ 0, r « 1 in (22-0) and (22-10) yields 

(1 - x^KPnPL - PfnPn) f - i-rnirn + 1) + n(n + })\ f P,n{x}Pn{x) dx 
Ul J-i 

Since (1 — x^) vanishes at dbl, the left side is zero and the oithogonality follows. It can 
be shown ^ also that ^ 

f lP„(r)fdx = -4-:> 

y -1 2n *f 1 


and hence the corresponding orthonormal 8(*t is 

V>n(x) * (a -f ^2y''Pri{x) 

Exximple 3. )/(‘t the sequence pi, p2, • • • be ihv distinct positive roots of the equation 
J^(x) « 0, so that JftiPn) 0. If M > 0 the functions 


<t>n(x) 


^-.;.(pnx) 


are orthonormal on the interval (0,1). 

By (14-i) it is found that y =* Jn{px) satisfies 


JliPn) 


( 22 - 11 ) 


(^V'Y 




Xxy, 


and hence (22-10) holds with p ^ r ^ x. If w^e choose J,i(px) and pm « J^ipmx), 
the left side of (22-10) is 

I d d 1 

X! |^•/>^(px) fYyPmX) fiipmX) | > 


which reduces to J nip) pniJlipni), since Jn(pm) * 0. It follows that 


J- 


{"^Pm P^) [ XJ ft{ptr^)Jfiipx) dx « PmJft{p)J^{Pm) • 

Jo 


(22-15) 


*See E. J. Whittaker and O. N. Watson, “Modern Analysis/’ p. 305, Cambridge 
University Press, Loudon, 1952; J. M. MucUobert, “Hphencal Harmonics,” p. 92, Dover 
Publications, New York, 1948; W. E. Byerly, “Fourier Series and Spherical Harmonics,” 
p. 170, Ginn & Oompany, Boston, 1893. 
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Since J^ipn) *• 0, the choice p « p„ in (22-15) yields 

f X%JfiiPnX) dx =** 0, JTC 5?^ tl. ( 

Jo 

Moreover, differentiating (22-15) with respect to p we get 

2p f X*Tfi{pmX)J^{px) dx -|- (p^ p^) f X'Jn{pmX)Jf^{px) dx ■= PmJf^{p)Juifim), 

Jo Jo 

which reduces to ^ 

2f xlJ^(pmX)fdx ( 

Jo 


when p *« Pm« Equations (22-16) and (22-17) show that the sequence (22-1*1) is ortho¬ 
normal on (0,1), as desired. 

The fact that the equation J^(x) « 0 has infinitely many roots p„ is established in 
treatises on Hesstd functions; analysis of such questions for general differential equations 
constitutes the so-called Sturm-Liouville theory. It can be shoun that Fourier sera's of 
Bessel or l^egeiidre functions actually converge; that is, an analogue of Diriehlefs the¬ 
orem holds m such cases.^ These questions are treated, from a very g(*neral point of 
view, in a branch of analysis known as spectral theory. 


PROBLEMS 

1. By considering tht' equation y" =» \y show that the st‘qu<‘ncc .sin nirx ^7 is orthogonal 
on the interval (0,0, i^ud construct the corrc.'-pondiug oithononna) '^ot. 

2. Supfiose an arbitrary function f{x) i.s expanded in a urnforioly convergent series 
f{x) « '^CnynU), when* yn are the functions in (22-11) Show that 

r„ *= ( ^<r)f(x)yy^(i) dx^ r(x)(//^(j')P dx^ . 

Hint Multiply the giv<'n .s<'ties by ?(j)(/r,(j), and integrate term by tt'rm. 

3 . If m -f- n is positive, show that 

(m- ~ aO fx'-\rM)Jn{x) dx - lUrminO) - Jrn{l)JUl)l 
Jo 

Hint' Be.ssel's equation (11-1) may be written 

X 

{xy’} -b xy = ~ y, 

X ^ 

where X « rr when ]{ = JJx) To avoid difficulty at j = 0 one may consider J and 
let c ■“> 0. The convergence follows from (11-0), siiu*<‘ (11-0) gives 
./„,(j')Jft(j') (const) as X —» 0 

4. It can be shown that as 1/| —t oo, 


[J , 

/ TT 7nr\ 

Kir) - 


12 , / 

V nirX 

/— cos 1 
» rrx 

r 4 *” y) ’ 

^ ^ 


1 

i 

1 


* E, A. Coddington and N. L<‘vinson, “Theory of Oixlinary Differential Equations/^ 
chap. 7, McGraw-Hill Book C/ompany, Inc.. New Xork, 1955 
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By letting / -* « in Prob. 3, deduce 

r 2 r 

x'^^Jin{jr)Jn{x) dx “* ~ sill (m — n) ~ • m 4- n > 0. 

IT 2 


If <^n(x) are orthonormal on interval (0,1), show that 
iPnix) * a“'‘Vn(x/o) 


are orthonomial on the interval (0,a) 


23. The Mean Convergence of Fourier Series. If we try to approximate 
a function/(x) by another function PnCc), the quantity 

l/W - Pn(x)\ or [f{x) - Pn{j-)f (23-1) 


gives a measure of the error in the approximation. The sequence 7>n(.r) 
converges to/(x) vvhen(3ver the expressions (23-1) approach zero as n co. 

These measures of the error are appropriate for discussing convergiaice 
at an}^ fixed point x. But it is oftcm useful to have a measure of error which 
applies simultaneriusly to a whole interval of x values, a < x < b. Such 
a measure is casil^^ found if we integrate (23-1) from a to 

( !/(x) - p„(/)i dx or f lf(x) - p„ix)f dx. (23-2) 

Ja Ja 


These expressions are called the rnean ^ error and m(an-sqxiarc error^ respec¬ 
tively, If either expression (23-2) approaches zero as n —> go, we say that 
th(‘ seipience Pn{^) convi^rges m mean to /(x) and we speak of mtaa con- 
vergerwe. 

Even though (23-2) involwsan integration which is not presimt in (23-1), 
for Fourier series it is much easier to discuss the mean-s(]uar(^ error and the 
corresponding mean con\ergeiice than the ordinarx' convergence. Such a 
discussion is presented now. 

Let <^;,(x) be a set oi functions normal and orthogonal on a < x < b, so 
that, as in the last section, 


I ^rl('^ dx* 

•fa 


0 for rn 9 ^ n, 

1 for 7n = n. 


(23-3) 


We seek to approximate /(x) by a linear combination of 

Pn{x} = ai</>i(x) -{- a^<t>2(x) d - h a,A,(x), 

in such a way that the mean-8c|uare error (23-2) is minimum: ® 

* Note that if the exprt'ssions (23-2) are multiplied by 1/(6 — a), wc get prceisely the 
mean valuea of the coi responding expieshioiis (23-1). 

* We use / and <f>n as abbreviations for f(j) and <^»n(x), resfKU’tively. It Is assumed that 
/ and <t>n if'Tc integrable on a < x < 6. If the integrals are improper, the convergence of 

/•fc rb 

I dx and / <f>X dx is required, 

Jq. Ja 
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rb 

E (ai0i H-h an<t>n)? dx « min, (23*4) 


Upon expanding the bracket we see that (23-4) yields 

f f^dX'-2f f(ai<f>i 4-. • - 4- an(f>n) dx 4- f 4-h o,n4>n)^ dx. 

•fa •fa Jq 

If the Fourier coefficients of / relative to <t>k are denoted by 

Ck = f S<i>k dx, 

•fa 

then the second integral (23-5) is 


(23-5) 


rb 

I 4" • • * + «n<#>n) dx = aiCi 4“ (I 2 C 2 4" ‘ • + anCn- 

Ja 

The third integral (23-5) may be written 

rb 

I 4 “ • * • + 4 - * * • + <ln<i>n) dx 

Ja 

~ + • • * + + ‘ ‘ •) dx 

== af 4" ' * • + 

where the second group of terras involves cross products with 

i 7 ^ j. By (23-3) these terms integrate to zero, and the expression reduces 
to the value indicated. 

Hence, (23-5) yields 

E ^ f f dx - 2 ^2 ^kCk + ^ cii (23-6) 

^ k^i k^i 


for the mean-.square error in the approximation. Inasmuch as 
-2akCk + al ^ -cl+ {ak - Ckf^ 
the error E in (23-6) is also equal to 

E = [V dx - 'tcl + Ziak- (23-7) 

k^l k^l 

and we have established a theorem of central importance: 

Theorem I. If <f>n ^ of normal and orthogonal fnruiiom^ the mean- 
square error (23-4) may be written in the form (23-7), where Ck are the Fourier 
coefficients of f relative to 4>k- 

By going back and forth between the two express!tms (23-4) and (23-7), 
one obtains a number of interesting and significant theorems with tJhe 
greatest ease. In the first place, the terms {uk — Ck)^ in (23-7) are positive 
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unless Uk « Cky in which case they are zero. Hence the choice of ak that 
makes E minimum is obviously Ok = Ck, and we have the following: 
Corollary 1. The partial svms of the Fourier series 


Ci4>l + --h 

th 


Ck 


■■ I j4>kdT, 


give a smaller mean-square error I (/ — pn)^ dx than is given by any other 

•'a 

linear combination 

pfi “ ai<hi "f" • • ’ ~f~ Ofiihff 


Upon setting Uk — Ck in (23-7), we see that the minimum value of the 
error is 


min 



Tt 




(23-8) 


Now, the expression (23-4) shows that E > 0, )>eeause the integrand in 
(23-4), being a square, is not negative Since E > 0 for all choicTs of 
it is clear that the minimum of E (which arises when Uk = Ck) is also > 0 
The expression (23-8) yields, then, 


f f dx - ci > 0 or Yi d < [ f dx. 

Upon letting n oo we obtain ^ 

fb 

Corollary 2. If Ck — f4»k dx are the Fourier coefficients of f relative to the 

Ja 

orlhonormal set <i>n then the series wcf converges and satisfies the schcalled Bessel 
inequality 

QO ^ 

E d < f lf(x)l-dx. (23-9) 

k^l 

Because the general term of a convergent series must approach z(to 
(S ec. I, Theorem I) we deduce the following from Corollary 2; 

fb 

Corollary 3. The Fourier coeffiaents Cn~ ftp^ dx teml to zero as n oo. 

Ja 

For applications it is important to know whether or not the mean square 
error approaches zero as n oo. PJvidently the error approa('hes zero, for 
some choice of the uajS, only if the minimum error (23-8) does so. Letting 
n —> 00 in (23-8) we get the so-called Parseval equality 

f fdx - Z c* = 0 
as the condition for zero error: 


* Since c* > 0, the sequence 2 i® nondecreasing. We have just seen tliat it is 
bounded, and hence the limit exists by the fundamental principle (Sec, 1). 
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Corollary 4. Iff is approximated by the partial mms of its Fourier series^ 
the mearir-square error approaches zero a«s n —> oo if^ and only if^ BesseVs 
inequality (23-9) becomes ParsevaVs equality 

23 <=« = / dx. (23-10) 

..=1 

In other words, the Fourier series converges to / in the mean-sciuare 
sense if, and only if, (23-10) holds. If this happens for every choice of /, 
the set <#>n(^) is said to be closed, A closed set, then, is a set that can be 
used for mean-square approximation of arbitrary functions. It can be 
shown that the trigonometrical functions (*os nx and sin nx are closed on 
0 < T < 27r, though the proof is too long for inclusion here ^ 

A set 0n(j^) is said to be complete if there is no nontrivial function ^ 
f{x) which is orthogonal to all of the (t>n^- That is, the set is complete if 

Ck -1 fix)(t)ki^) dx - 0 for A; = 1, 2, 3, ..(23-11) 

implies that 

fW)]' dx = 0. (23-12) 

Ja 

Now, whenever (23-10) holds, (23-11) yields (23-12) at once. Hence we 
have: 

Corollary 5. Every closed set 4>n(x) is complete. 

The oouverse m also true Every complete set is dosed This converse, nowever, requires 
a more general integral than that of liiemaiin The goneialized integral is known as 
the Ijebesgue integral; it was first constructisl to d<avl vsitli this very pioblem. A brief 
description of the Lebesguc iiit(‘gra] is given in Apiiemhx (' 

The notions of closure ami compld( russ hav<‘ simple analogues in the elementary the¬ 
ory of vectors Thus, a set of vectois Vi, V2, V3 is said to be closed if every m'tor V can 
be WTitten m the form 

V - ciVi + rsVs + QV3 


for some choice of the constants a The set of vectors Vi, V 2 , V 3 is said to be complete 
if there is no nontnvial vector oitlK>g()nal I 0 all of them That is, the set is complete if 
the condition 

V-Vfc « 0 for A: = 1, 2, 3 

implies V*V »» 0. 

In this setting, it is obvious that closure and completeness are equivalent, for both 
conditions simply state that the three vectors Vj, V 2 , V 3 are not coplanar. These matters 
are taken up more fully in (^hap. 4. 


1 See E. 0. Titchmarsh, “The Theory of Functions,” p. 414, Oxford University Press, 
London, 1050 

* In the theory of mean convergence /(x) is regarded as trivial if /(x) « 0 for so many 


values of x that 



dx 


0 . 
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PEOBLEMS 

1 . (a) Show that Parseval’s equality takes the form 

1 A to 

- I (/(*)]*'ir-5ao+Z(«S + i^i) 

T Jo 2 ,_i 

when 4>nM are the trigonometric functions on (0,2ir). (b) Specialize to sine and cosine 
series on ( 0 ,Tr). 

2. It is desired to approximate 1 by 

p(x) *« ai sin x -f* 02 sin 2 ^ -f* ^3 sin 3 j 

in such a way thatfl — p{x)f dz is minimum. How should the coefficients a, be 
determined? ^ 

3. Give a direct proof that as n —> oo, 

/‘tr . 2 Jr 

I f{x) sin ax dr 0 , / f(x) cos nx dx 0 , 

Jq Jt) 

if fix) is i>eriodic of period 2?r and has a continuous derivative /'(x). Hmt: Integrate 
by parts, 

4. Obtain the formula ak ** fk from (23-4) by using the fact that dE/dak »= 0 at the 
minimum value of E, 

24* The Pointwise Convergence of Fourier Series. We shall now obtain 
an explicit formula for the difference betw^een a function and the nth partial 
sum of its (trigonometric) Fourier series. The formula will enable us to 
establish the convergence for a class of functions which includes all the 
examples given in this book. 

If f(x) is a bounded integrable function of period 27r, the nth partial 
sum of its Fourier scries is 

1 ” 

~ Uo + s (at cos kx + 6jfc sin kx), (24-1) 

2 

where the coefficients are given by 

Uk — - f f{t) cos kt dt, bk - - f f{t) sin kt dt, (24-2) 

TT TT •'•-V 


Substituting (24-2) into the series (24-1) we get 
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If we define the so-called Dirichlet kernel by 

1 n 

I>n{u) ^ COS kUf 

2 

the foregoing result takes the simpler form 

Sn{x) -if f{l)Dn(i - X) dL 

TT ’'—r 

Setting t — X = u in (24-4) yields 

1 rir—T 

Sn{x) = ~ / /(/ + du. 

TC -'-“Jr—r 
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(24-3) 

(24-4) 

(24-5) 


Now, Dn{u) has period 27r by inspection of (24-3), and /(r) also has period 
27r. Hence, the integral of/(u + x)J)„(u) over any interval of length 27r is 
the same as the integral over any oilaa' int(‘rval of lenglh 2ir, and (24-5) 
may be replaced by 

1 rx 

^n{x) = - / fU + ^i)Dn(n) du. (24-0) 

Since Z)„(-u) = Dn(u) t)y (24-3), we may replace u by —u in (24-6) to 
obtain the alternative form 

I rw 

Sn{s) = - /(.r - u)Dn{u) du. v‘^4-7) 

TT X 

The sum of (24-6) and (21-7) yields 

2^n(x) = - r !/(^ +u)+ f(l - lO]Du[u) du. 

TT 

Since the integrand is an even function of a, tin* integral from 0 to tt is half 
the integral from — tt to tt, and we have thus estebhshed that 

1 /-x 

= - / lf{x + u) +f{x - n}]Dn(u) du. (24-8) 

TT •'C 

To introduce/(x) into our considerations, we observe that 

- fOniv) du, (24-9) 

2 TT •'C 

since the terms involving cos/cw in (24-3) integrate to zero. If (24-9) is 
multiplied by 2/(x) (which is constant \\ith respect to the integration var¬ 
iable r), we get 

f{x) = - / 2fix)Dn{u) du, 

IT •'0 


(24-10) 
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Subtracting (24-10) from (24-8) gives the fundamental formula 

s„(x) - f{j) = ~ f^lfix + u)- 2f{x) + fix - u)]Dn{u) du, (24-11) 

which will now be used to study the convergence of Sn{x) to f(x). 

We sliail say that f(x) is piecewise smooth if the graph of f{x) consists of 
a finite number of curves on each of which /'(x) exists. We suppose also 
that the derivative exists at the end points of these curves, in the sense 


f{x ti) - f{x+) 

lim —- 

u —» 0 + u 


or 



fi x - u) - fi x-) 


(24-12) 


where “w —> 0+^’means?/ 0 through positive values. Such a function 
may have finitely many discontinuities. However, since the Fourier co- 
eflRcients of J{x) are not altered if /(x) is redefined at a finite number of 
points, we can assume that 


/(*r) = 


/(^+)+/(x-) 

2 


(21-13) 


at every point x, whether/(x) is continuous at x or not. 

These preliminaries lead to the following theorem: 

Theorem. If /(x) is periodic of period 2t, is piecewise smooth, and is 
defined at points of discontinuity by (24-13), then the Fourier series for /(.r) 
converges to f{x) at every value of x. 

To establish this theorem we recall that the series (24-3) was summed in 
Sec, 17, Example 1. The result (17-5) yields 


DrXu) 


sin {n •+• y^)u 
2 sin }x 2 U 


(21-11) 


If we substitute this into (24-11) and replace 2/(x) by /(x+) +/(x~-) in 
accordance with (24-13), we get 


«n(x) - f(x) 


1 /•» f(x -h u) - /(x-i-) -f- /(Z - U) - f{x-) . 


T •'0 


Sin 


2 sin 34^ 

Now, the expression 

f{x + u) - fix+) _ f(x -f u) - f{x+) (m/2) 

2 sin u 

has a limit as u —> 0-hj since 


+ 2) ^ 


sin {u/2) 


(24-15) 


u/2 
lim 


1 


sin iu/2) 


as 0 
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and since the limits (24-12) exist by hypothesis. If we define the value of 
(24-15) at w «= 0 to be this limit, then the expression is continuous for 
R > 0 as long as the points 

[x,/(a:4’)] and [x + u, f(x + tO] 

are on the same smooth curve belonging to the graph of f{x). 

On the other hand, for 0 the function (24-15) is just as well behaved 
as the numerator/(x + w) - /(.r+), since sin does not vanish. This 
shows that the graph of (24-15) consists of a finite number of continuous 
curves, which have finite limits as one approaches their end points and 
hence are bounded. 

It follows from Corollary 3 of the preceiling section ^ that 



/(j -f u) - f{x+) 
2 sin 



udu ~ 0 . 


In just the same way it is found that 


lim 

n ~+ 


£ 


•^/(X - H) 


2 sin ^ 2^1 


Six-) 

-- sm 


in u du = 0 


and hence th(^ integral reiiresenting ,Sn(x) — /(x) tends to zero as n 
This shows that 


lim .s„(x) = /(x) 


n "-+<*• 


00 . 


and c<»nipleles the proof of the theorem. 

26. The Integration and Differentiation of Fourier Series. If /(x) is 
piecewise conlinuous* on then the function 

Fix) = r fit) di (25-1) 

J —TT 

is continuous and piecewdse smoolh (Sec. 24). Moreover, F{x) remains 
continuous when defined to have period 27r, provided F(~Tr) = F{w), 
Since F(~ 7 r) = 0, the latter condition reduces to 

Fir) = £y(0 dt = irao = 0 (25-2) 

where is Ihe first Fourier coefficient of fix). Applying the theorem of 
‘ The presence of the H in sin (n + causes no trouble, since 
sin (n 4* ^ i)u » sin nu cos Hu 4* cos nu gin 
and Corollary 3 applii's to each term. 

•This means that the interval (~7r,irl can be divided by points xi, 22 , ..Xn into a 
finite number of intervals on each of which f{x) is continuous. Also f{x) must have a 
limit as a: —» Xfc-f and as x Xk — . 
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the preceding section, we can now deduce that the Fourier series for the 
periodic function F{x) converges to F{z) at every value of x. 

This result can be obtained when f(x) and \f(x) | are only assumed integrable, without 
being piecewise continuous. Indeed, we can always write f(x) P(x) — N{x) where 
P(x) is positive and —N(x) is negative. The equation 

F(x) « r P(0 di - r N(t) dt 
J —r J —»■ 

expresses F(x) as the difference of two increasing continuous functions. Since such 
functions satisfy the Dirichlet conditions, the desired result can be deduced from Diri- 
chlet's theorem as quoted in Sec. 18. 

We shall show next that the Fourier series for F(x) is obtained by inte¬ 
grating the series for f(x). If n > 1, the Fourier cosine coefficient An of 
F(x) satisfies 

pf sin 'i 

vAn ^ I F(x) cos nxdx ^ F(x) - 

T n 

when we integrate by parts. Since F(—t) ~ F(7r) « 0, the integrated 
part drops out, and since F^{x) « /(r), the expression becomes 

1 fT hn 

vAn —-/ sin nr f(x) dx » —w — • 

n n 

In the same way Bn — a„/n, and also 

1 fv 

-4o “-/ x/(a-) dx. (25-3) 

w 


rr sin nx 

- / - F'ix)dx 

J —IT n. 


These considerations establish the following remarkable theorem: 

Theorem L Lei f{x) be a function of period 27 r which has a Fourier series 

S(an cos nx + sin nx). (25-4) 

TheUj with .4o given by (25-3), 


f /(O 

J —r 


dt 


1 


' i4o + 2 


\n 


sm nx -cos nx 

n 


)■ 


(25-5) 


and this equation holds for all x, even if the Fourier series (25-4) does not 
converge. Moreover^ (he series (25-5) is actually the Fourier series of the 
function on the left. 

In case % 9 ^ 0, so that the Fourier series for f(x) is 
^ao + 2(a„ cos nx + bn sin nx), 
we apply Theorem I to f(x) — 3^ao. Inasmuch as 

/ /(X) dx « r fix) dx - r fix) dx « Fifi) - Fia) 
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for all a and 0, the reader may deduce, by Theorem I, that 
rfi f 0 

I f(x) dx = / (J^oo) dx + S / (o„ cos nx + K sin nx) dx. (25-6) 

Ja Ja Ja 

This result may be summarized as follows: 

Theorem II. Any Fourier series {whether convergent or not) can he m- 
tegrated term by term between any limits. The integrated series converges to 
the integral of the periodic function corresponding to the original series. 

For example, according to (18-5) the Fourier series for is 


:: a; “ sm a; 
2 


8m2x sinSx 

--j---j.. 




(26-7) 


If we integrate from a to x by Theorem II, we get 

^ / 2 ^ cos nx — cos na 

~ix^ - J) -» 2](-~l)"-j- 

4 «-i n* 

Treating o as constant, we see that 


4 ,^1 n* 

where C is constant. Since C is the first Fourier coefficient of 


^i^-C + LC-D" 

n«l 

1 first Foil 

1 1 
2 r 

25-7), we 


cos ?IX 

_ 


dx 


2ri^,4 12 

Alternatively, because oo 0 m (25-7), we can use (25-3) to obtain 

: X* dx 

and hence, by (25-5), 

' .=) 


I 


F(x) « 2 (X* 
4 


O n«l n* 


(25-8) 


(25-9) 


The consistency of this result with (25-8) and (25-9) is easily verified. 

Although Fourier series can always be integrated, as we have just seen, 
the differentiation of Fourier scries retjuires caution. For example, the 
series (25-7) converges for all x, and yet the series 

cos X — cos 2x + cos 3x — cos 4x H- 


obtained by differentiating (25-7) diverges for all x. The trouble is that 
the function y^x (when made periodic) has no derivative at the points rirw, 
=fc:3x, ±:5ir, - 

This example is quite typical of the general situation, which can be de¬ 
scribed as follows: There is not much hope of being able to differentiate a 
Fourier series^ unless the periodic function generating the series has a deriva¬ 
tive at every valu£ of x. On the other hand, when this condition is fulfilled, 
we usually can differentiate, as is shown by the following theorem; 
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Thesorem III. Let f{x) have period 2ir, and suppose f{x) exists for every 
value of Xy without exception. If f{x) is continuouSy^ the Fourier series for 
f(x) can be obtained by differentiating the Fourier series for fix). If fix) 
is continuous and has only a finite number of maxima and minima on [ — ■n'^Tr], 
the differentiated scries actually converges to fix) for every x. 

Repeated application of the theorem gives the corresponding result for 
higher derivatives. For instance, the series for/"(x) can be found by dif¬ 
ferentiating the series for fix) tAvice, provided /"(x) satisfies the conditions 
of the theorem. 

We shall establish Theorem III by applying Theorem I to the function 
fix). Being continuous, fix) has a Fourier series, and the constant term 
ao can be found from 

TOO = f fix) dx - fin) - fi-n) = 0. 

Thus, the series for/'(x) has the form (25-4), namely, 

S(an cos nx + sin r?x). (25-10) 

It follows from Theorem I that the Fourier series for the function 

r fit) dt^ fix) -fi-n) 

J 

has the form (25-5), and hence the series for/(x) is 

/( — it) + - ii() + S sin nx -^ cos nxV (25-11) 

2 \n n / 

By inspection, we see that differentiating (25-11) gives (25-10). In other 
words, the Fourier series for fix) can be found by differentiating the series 
for fix)y and this is the main assertion in Theorem III. 

Since the differentiated series is a Fourier series, its convergence can be 
tested by the usual methods. In particular, if fix) satisfies the Dirichlet 
conditions and is continuous, then the Fourier series for fix) converges to 
fix). Thus, Theorem III is established. 

The foregoing methods lead to some important inequalities for the 
Fourier coefficients. Wffien a function fix) satisfies the Dirichlet condi¬ 
tions, it can be sho^Nm ^ that the Fourier coefficients have the order of mag¬ 
nitude 1/n. That is, there is a constant M depending on fix) but not on n 

* It can be shown that satisfies the conditions of Dirichlet, then f'ix) is neces¬ 

sarily continuous. This follows from Darboux’s theorem. See, for example, L. Brand, 
“Advanced Calculus/^ p. 112, John Wiley Sons, Inc., New York, 1955. 

*See 1. 8. Sokolnikoff, “Advanced Calculus,” p. 406, McGraw-Hill Book Company, 
Inc., New York, 1939, Cf. also Prob. 4. 



SKC. 25] 
such that 


ADDITIONAL TOPICS IN FOURIER SERIES 


211 


\cLn 



\hn\ 



(25-12) 


Now, if the Fourier coefficients of fix) in (25-10) satisfy these conditions, 
then (25“11) shows that the coefficients of /(j) are bounded by Mfn^. 
More generally, we can start with and integrate fr times. The con¬ 

stants of integration drop out as in the derivation of (25-10), and we 
obtain: 

Theorem IV. Let f{x) have period 2ir and suppose the kih derivative of 
f{x) satisfies the conditions of Dirichlet on [~-T, 7 r]. Then the Fourier coeffi¬ 
cients of f{x) satisfy the inequalities 


M 

"iTfT’ 


\hn\ < 


M 

1 


where the constant M depends on f{x) but not on n. 


PROBLEMS 


1. By integrating the series (25-8) from 0 to j deduce that 

sin 


« 12 S (-I)”- 


n«.i rr 

2. By integrating the series m Prob. 1 from —v to x deduce that 


1 

48 


(T^ - 


90 




3. Show that the following is not a Fourier series: 

^ sin nx 
n^i log (1 -h n) 

Hint If it is a Fourier series, the integrated senes must converge for all x. 

4. Deduce (25-12) v\lu‘n/(j) is piecewise smooth on [--;r,ir). Hint: I^et the points 
Xk divide [ — into a finite number of intervals on each of which/'(x) is continuous. 
The Fourier <*oeffi(’ients are obtained by abiding integrals of the type 

f V(^) ^*<58 nx dx or f ^^fix) sin nx dx, 
and these can be integrated by parte. 
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Tlie considerations of the preceding chapters were confined primarily 
to functions y = f{x) of a single independent \'ariable x. One does not 
have to go far to encounter functional relationships depending on two or 
more independent variables. In courses in analytic geometry and calculus 
the reader has learned that a fun<*tional relationsliip of the form z = f(x,y) 
may be represented as a surface, and lie has made use of partial derivatives 
to study some proi)erties of surfae(‘s. In this chapter tlie familiar concepts 
underlying the study of real functions of two variables are sharpened and 
extended to functions of many variables The bearing of such extensions 
on the (*alculation of rates of change and maximum and minimum values 
of functions of several variables is indicattnl in numerous problems of 
practical interest. 

llie concluding sections of the chapttT deal with integrals of functions 
of several variables They contain an introduction to the calculus of 
variations - a subject of great importance m pliysics and technology. 
Many situations can be i‘liara<‘terized }>y statements to the effect that 
certaui integrals attain extreme values. Th(‘ determination of such ex¬ 
tremes is in the province of calculus of variations. 


THE TECHNIQUE OF DIFFERENTIATION 

1. Basic Notions. Let z - /U,y) bo a real-valued function of two inde* 
pendent variables (x,y). We can think of (x,y) as the coordinates of a 
point in the xy plane and interpret z as the height of the surface defined 
by = f(x,y). The function fix,y) may be determined for every point 
{x,y) in the xy plane, or the points for which it is determined may occupy 
a certain region R in that plane. 

For example, 

z^x^ + y^ (M) 

represents the paraboloid of revolution for every pair of values while 

z = (1-2) 

217 
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represents the surface of the hemisphere only for those values of (j,a/) 
for which < \, In the example ( 1 - 1 ) the region R is the entire 

xy plane, while in ( 1 - 2 ) it is the interior and Die boundary of the unit 
circle » 1 . The function 


1 


(1-3) 


is defined in the circular region < \, but not on the boundary 

^ 1. If the region of definition of the function includes its 
boundary 6 \ we sliall say that the function is ( 1(^1 ined in the closed region R. 
When the boundarj^ of the region R is not included, the region is said to 
be open. 

To define the continuity of 2 = f(x,y) at a given point we need the 
notion of the neighborhood of that point. The neighborhood of Pixo^yo) 
is the set of all points P(x,y) interior to a circle with center at (xo,yo). 
If the radius of this circle is 6 > 0 , then the neighborhood of (x(),zA)) is 
a circular region (.r — xo)'^ ^ (y ^ < 52 , Xhe positive number 6 

can be chosen arbitrarily small. The extension of this definition to si)a(‘os 
of more than two dimensions is immediate. The neighborhood of 
Pixo.yoZo) is the open spherical region 


(j - Xof + (y - yuf + {z - Zof < 


The neighborhood of the point PoiX(),yo,Zodo) in the space of four variables 
X, y, z, th the set of ^‘points^^ {x,y,zd) such that 

(x — xaf + (y ~ i/o)^ + (2 2 : 0 )^ + {t — < 5^, 

and so on for spaces of higher dimensionality. 

Intuitively the notion of continuity of 2 ^ f(Xjy) at a given point 
^o(*ro,?yo) means that the value of f{x,y) throughout a neighborhood of 
(to,2/o) will differ from/(xo,2/o) l>y as little as desired if the neighborhood is 
chosen sufficiently small. In symbols this means that if one specifies a 
positive number «, no matter iiow small, then for all points in a certain 
circular region (x — xo)^ + ( 2 / ~ yof < we have 

\fix,y) - /(xo,2/o) 1 < «. (1-4) 

An alternative notation for (1-4) is 

Um f(x,y) = f(xo,yo), (1-5) 

ix.v) 

which states that as the point {x,y) is made to approach (xo,j/o), the value 
of the limit is equal to the value of the function at (xo,j/o)- 
We extend this definition (1-5) to functions/(xi,X 2 ,. . .,x„) of n variables 
in the obvious way; A function /(xi^ 2 ,- ■ - jXn) is continuous at the point 
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Po{xi,X 2 ,.. .,*2) whenever 

lim fixi,X 2 ,...,®») = fix'l^xl ,... ,xl). 
p-p. 

The “point P'* here means the set of n real numbers (xi,X 2 ,... ,Xn). Clearly, 
/(xi,X 2 ,... ,.rn) cannot be continuous at (xi,x 2 ,... ,Jn) if it is not defined at 
that point. 

Whenever /(xi,X 2 ,.. .,Tn) is continuous at every point P of the given 
region /?, it is said to be continuous in the region R. Functions with which 
we shall deal for the most part will be continuous in some region, open or 
closed. 

PROBLEM 

Describe the regions of definition and the surfaces defined by the following functions z: 

(a) Jf - ?/ -f 2 « 1; (6) 2 « 7/; 

(c) 4. ^2 « 25; (d) 2 « 1 /(.r^ -f y^); 

(e) z ^ \/x\ {/) 2 «= Vl - (j - 1)2 - 7/2. 

2. Partial Derivatives. Let u — /(x,y) be a function of two independent 
variables j, y, and let it be defined at a i)oint (.T(),yo) and for all values of 
(jjy) in some neighborhood of (xo,i/o). If y is set ecpial to ,Voi then u be¬ 
comes a function of one variable x, namely, 

u = /(x,i/o)* 

If this function has a derivative with respect to x, the derivative is 
called the partial derivaiwe of f{x,y) with respect to x for y == y^. In like 
manner, if x is asvsigned a constant value x«), the derivative with respect 
to y of the resulting function f{x(),y) is called the partial clerwativc of f(x,y) 
with respect to ij for x = Xo. The customary notations for the partial 
derivative of w = f(x^y) with respect to x arc 

da df 

’ S-f} and ’ 
dx dx 

The partial derivatives of a function /(xi,X 2 ,... ,Xn) of n independent 
variables are obtaiiHvJ by fixing in it the values of n - 1 variables and 
computing the derivative of the resulting function of a single variable. 
Thus, 

f{x,y) = - 2yx (2-1) 


has the partial derivatives 
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If « » J{x,y) is a function of two independent variables, it is easy to 
provide a simple geometric interpretation of partial derivatives w* and Uy. 
The equation u = f{x,y) is the equation of a surface (see Fig. 1). If x 



Fio. 1 


is given a fixed value Xq, u =f(xo,y) is the equation of the curve AB on 
the surface formed by the intersr^etion of the surface and th<‘ plane x = Xq. 
Then 

^ dy A|/ 0 Ay 

is the slope at any point of AB. Similarly, if y is assigned the constant 
value 2/or then u — /(x,?/()) is the equation of the curve CD on tlie surface 
and 

f(xn + At, ?/<,) - /(ro,?yo) 

Ux ^ — = lira--— 

dx Ax -► 0 Ax 


is the slope at any point of CD. 

In Chap. 5 we shall sec that the partial derivatives Uy, of u = 
can be interpreted as rectangular components of a certain vector, 
called the gradient of u. This vector provides a measure of the space 
rate of change of u. 

The partial derivatives /x„ of /(xi,x 2 ,.. .,Xn) arc functions 

of Xi, X 2 , .. Xn, and they may have partial derivatives with respc^ct to 
some or all of these variables, Tliese derivatives are called second partial 
derivatives of /(xi,J 2 # • • If there are only two independent variables, 

/(x,y) may have the second partial derivatives 


dx \dx/ dx^ 
dx \dy/ dx dy 
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It. should be noted that /a:„ means that df/dx is first found and then 
is determined, so that the subscripts indicate the order in 
which the derivatives are computed. In 

dy dx dy \dx/ 

the order is in keeping with the meaning of the symbol, so that the order 
appears as the reverse of the order in which Ihe derivatives are taken. 

For the function f{x,y) in (2-1) we get, on noting (2-2j, 

d /df\ a 

~ ^ ^ ■“ ^y) “ 2 j * — 2 , 

dy \dx/ dy 

^ /dA d „ 

fyt = “('■) = ~ - 2j) = 2x - 2 

dx \dy/ dx 

fuv ~ ^ 

dy 

d 

fxx = -7 ( 2 .r?/ - 2y) = 2y. 
dx 

In this exampleand indeed, one rarely meets functions for which 
the so-called mixed derivatives ar<' unequal. In fact one can prove ^ that 

_dV ^ _d^ 

dx dy dy dx 

W’henover those derivatives are continuous at the point in question. 

The pro(*ess of defining partial derivatives of Jiigher orders is obvious 
from the foregoing, and it is [lossibie to establisli e(|ualities such as 
^ fxxy “ fyxx Rud /yzj/ = Jxyy ^ fyyj wheHCver tlipse dcrivativps are con¬ 
tinuous at the point in question. 

We note in eonelusion that although the notation du/dx lor tlie partial 
derivative Vx suggests a quotient of some quantities analogous to t)ie dif¬ 
ferentials dy and dx in the notation dy/dx for the derivativf^ of y = f(x), 
no such interpretation is availaide for partial derivatives. To stress the 
point that du/dx should never be tliought of as a fraction, we give an 
example. 

Emniph\ Consider tlie (‘(nuition for an idi'al gas pv * RT, whf*ie p ife the pressure, 
V IS tt\e volume, T is the absolute temperatuus and A' is a physical eonstant. It should 
l)e noUni first that the concept of partial <leriva(iv(‘vs hinges on tlie agicement as to which 
variables in a given functional relationship are assumed to be independent. Thus, if 

* Boe 1. S. Sokolriikoff, ‘‘Advanced Calculus,’' see. 31, McGraw-Hill Book Company, 
Inc., New York, 1939. 
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we solve our gas equation for p, we obtain 
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We can then compute 


dt; * 


R 


On the other hand, if we solve for w, we get 


in which p and T are now regarded as the independent variables, and we can, therefore, 

av R av RT 

ar “ p’ ap “ p* 

We can also solve for T and get 


in which p and v are to be considered as the independent variables, so that 

dT V dT p 

dp R dv H ^ ^ 

From Eqs. ( 2 - 3 ) to ( 2 - 5 ) we obtain 

dv dT dp p R * 

since pv ^ RT, But if it were possible to treat the terms in the left-hand member of 
(2-6) as fractions, we should have obtained -fl. 


PROBLEMS 

1 . Find dz/dx and dz/dy for each of the following functions* 

(a) z *» y/x‘, (h) z » -|- tan“* (j//a?); (c) z *= sin xy + j; (d) z « log y; 

(e) z »» x^y -h sin”^ x 

2 . Find du/dx, du/dy, and Ou/dz for each of the following functions: 

(a) u *» x^y 4-1/2 — xz^; (h) u « xyz + log xy\ 

(c) a «» z sin~^ (x/V); (d) w •* (r® -b y* -h 
[e) a « (x^ + i/ + 

d. Verify that d^f/dx dy ** d^f/dy dx for 
(a) / ** coa x//^, (6) / » sin^ x cos y, (c) / « 

4 . Prove that if 

(a) f{x,y) « log (x* -f V*) + tan~^ tlien ^ -f «* 0; 

X dxr djT 

Q>) f{x,y,z) « (x* 4-1/® -f 2^)““'^, then ^ *“ 

dx* dy* dz* 
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3. Total Differentials. The differential dy of a function y » S{x) is 
defined by the formula 

dy = fix) dx, (3-1) 

where dx s Ax is an arbitrary increment of the independent variable x. 
We agree to call an increment of the independent variable x the differential 
of x. 

Since (Fig. 2) 



/(X-f Ax) -/(X) 

/ (x) == hm iim ~ “—--- 

Ax —♦ 0 Ax Ax —0 Ax 

we can write, on recalling the definition of th(‘ limit, 

-/ =/'(x) + €, 

Ax 


(3-2) 


(3-3) 


where lirn € = 0. Hence 

Ax 0 

Ay f\x) Ax -b t Ax. 
The substitution from (,3-1) in (3-4) then yields 

Ay ^ dy + t Ax, 
lim € = 0 as Ax 0 . 


(3-4) 


(3-5) 


Figure 2 illustrates geometrically the relations between Ay, dy, and dx, 
and formula (3-5) sliows that for small values of Ax, the increment Ay 
is a good approximation to the differential dy in the sense that 


Ay - dy 


Ax 


(3-6) 


where e —> 0 as Ax —► 0. 

One can construct a similar approximation to the increment An for 
the function u = /(x,y) when x and y are allowed to acquire the respective 
increments Ax and Ay. 
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The presentation of the essential ideas in this coiietruetion is greatly 
simplified by the use of the mean-value theorem of the differential calculus. 
This theorem states that whenever f(x) has the derivative f{x) at every 
point of the interval (r, x + Ax), then 

f(x + Ax) — /(x) 

=./'({), (3-7) 

Ax 

where f is an intermediate point in the interval. The geometric meaning 
of this theorem is exceedingly simple. Formula (3-7) states that the sloi>e 

/(x + Ax) - /(x) 

Ax 

of the secant line AB (Fig. 3) is equal to the slope f\0 of the tangent 



line CD to the curve y = /(x) at an intermediate point f in the interval. 
Since ^ = x + Ax, where B Ax is some fraction of the length Ax, we 
can write (3-7) as 

/(x + Ax) - /(x) - /'(X + ^ Ax) Ax, 0 < ^ < 1. (3-8) 

Consider now a function u = f{x,y) of two variables. The increment 
Aw that results from replacing x by x -f Ax and y by y -f Ay is 

Aw fix + AXyy + At/) - f{x,y), (3-9) 

If we add and subtract /(x, y + Ay) in the right-hand member of (3-9), 
we obtain 

Aw [/(x + Ax, j/ -f Ay) - /(x, y -f Ay)] + [/(x, J/ + Aj/) - /(x,|/)]. 

(3-10) 
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Tbe expression in the first pair of brackets in (3-10) is the increment in 
the function/(x, 2 /) when the second variable in it has a fixed value y + A;/, 
Accordingly, we can apply formula (3-8) to it and write 

f(x + Ax, 2/ + A2/) y + ^y) ^ /x(x + Ax, 2 / + A^) Ax, (3-11) 

where 0 < < 1. 

Similarly, the application of (3-8) to the expression in the second set 
of brackets in (3-10), in which x has a fixed value, yields 


/(x, y + Ay) - f(T^y) = ^(.r, y + $2 Ay) Ay, 


(3-12) 


where 0 < ^2 < 1 * 

Now' if the partial derivatives/^(x,?/) and/j,(x,//) are continuous functions, 
then 

f^(x + 6iAx,yi~ Ay) = fjx,y) + ei, 

, (343) 

fy(x, y + $2 Ay) - fy(x,y) + 62, 

where lim = 0 and lim €2 = 0 as Ax and Ay approach zero. Hence we 
can write (3-11) and (3-12) in the forms 

/(x + Ax, t/ -f- Ay) - /(x, y + Ay) ^ [fx{x,y) + € 1 ] Ax, 

/(x, y + Ay) - f{x,y) = [fy{x,y) + ^ 2 ] Ay, 

so that (3-10) becomes 

A?^ = fx(x,y) Ax +fy{x,y) Ay + ti Ax + €2 Ay. (3-14) 

If we define the differential du of u = /(x,y) by the formula 

du Ax+/yAi/ (3-15) 

d/ df 

s — Ax H- Ay 

dx dy 

we can write (3-14) in a form analogous to (3-5): 

Aw ~ du + Cl Ax + 62 Ay, 

lim Cl = 0, Urn € 2—0 as Ax 0 and Ay —> 0. 

Formula (3-16) shows that when the increments Ax and Ay are small, the 
differential da is a good approximation to Au in the sense that 


(3-16) 


Au — du 


Cl Ax + €2 A|/ 


•v/(A2p + 


0 


as Ax and A^ approach zero. 

As in the case of functions of one independent variable, we agree to 
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write the increments Ax and Ay in the independent variables as dx and dy, 
respectively. Then (3-15) reads 

df df 

du - —+ —dy. (3-17) 

dx dy 

Whenever (3-16) holds, the function u ~ /(x^y) is said to be differentiable, 
and du in (3-17) is called the total differential} A function which is dif¬ 
ferentiable at each point of a region is said to be differentiable in the region. 
The foregoing discussion shows that a function f{x,y) is differentiable 
whenever the partial derivatives Jx and Jy are continuous. 

The foregoing considerations can be extended to functions u = f{xi,X 2 , 

... ,x„) of n independent variables. The total differential du is given 
by the formula 

df df df 

du == — dxi -j- dx 2 + * • • H- dxn (3-18) 

dxi dx2 dXfi 

whenever the partial derivatives fx^ are continuous functions. 

It should be noted that the total differential du is ecjual to the sum of 
n terms involving independent increments dxt When a number of small 
changes are taking place simultaneously in a system, each one proceeds 
as if it w^ere independent of the others, and the total change is the sum of 
the effects due to the independent changes. Physically, this corresiK)ri<lh 
to the principle of superposition of effects. 

Example 1. Find the total differential of a « e^yz^. Since Ux, Uy, Ut are obviously 
continuous functions, formula (3-18) yields 

du « dx -f dy -h 2c^yt dz. 

Example 2. A metal box without a top has inside dimensions 6 by 4 by 2 ft. If the 
metal is 0.1 ft thick, find the actual volume of the metal used and compare it with the 
approximate volume found by using the differential. 

The actual volume is A V, where 

aF » 0.2 X 4,2 X 2.1 ~ 6 X 4 X 2 « 54.684 - 48 « 6.684 ft>. 

Since V *» zyz, w here z « 6, y « 4, z « 2, 

dV yz dx -b xz dy 'jr xy dz 

« 8(0.2) -f 12(0.2) 4- 24(0.1) « 6.4 ft*. 

* In Chap. 5 we shall encounter expressiona of the form (3-17) in which fx and fp be¬ 
come discontinuous at certain points of the region and u is a multiple-valued function. 
Such expressions are generally called exact differentials, and they are also denoted by the 
same symbol du. For technical reasons, explained in Chap. 5, it is usually necessary to 
assume the continuity of fx and fy, in which event the terms exact and total differentials 
become synoi^ymous. A geometric meaning of the differential (3-17) is given in Sec, 
10, Chap. 4. 
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Example 3. Two sides of a triangular piece of land (Fig. 4) are measured as ICO and 
125 ft, and the included angle is measunKi as 60°. If the possible errors are 0.2 ft in 
measuring the sides and 1 ° in measuring the angle, what is the approximate error in the 
area? 



Fuj. 4 

Since A » }^xy sin a, 

dA ** }/i(y sin a dx x sin a dy xy cos a da), 
and the approximate error is therefore 

A 

125 (--) (0.2) + 100 (-i) (0.2) + 100(125) (i) - 74.0 ftl 


PROBLEMS 

1. A closed cylindrical tank is 4 ft high and 2 ft in diameter (inside dimensions). 
What IS the approximate amount of metal m the wall and the ends of the tank if they 
are 0,2 in. thick? 

2. The angle of elevation of the top of a tower is found to be 30°, with a possible error 
of 0.5°, The distance to the base of the tower us found to be 1,000 ft, with a possible 
error of 0.1 ft. What is the possible error in the height of the tower as computed from 
these measurements? 

3. What is the possible error m the lengtli of the hyi>otenuse of a right triangle if the 
legs are found to be 11 5 and 7.8 ft, with a possible eiror of 0.1 ft in each meafiurement? 

4. The constant C m Boyle’s law pv = C is calculab^d from the measurements of p 
and V. If p is found to be 5,000 lb per with a possible error of 1 per cent and v is 
found to be 15 ft^with a possible error of 2 per cent, find the approximate possibleerror in 
C computed from thiuse measurement*. 

6. The volume v, pres.sure p, and absolute tempt*rature T of a perfect gas are con¬ 
nected by the formula pv « R7\ where R is a constant. If 7" « 500°, p «= 4,000 lb per 
ft''*, and e »* 15 2 ft^, find the approximate change in p when T changes to 503° and u 
to 15.25 ftl 

6. In estimating the cost of a pile of bricks measured as 6 by 50 by 4 ft, the tape is 
stretched 1 per cent beyond the estimated length If the count is 12 bricks to 1 and 
bricks cost $20 per thousand, find the approximate error in cost. 

7. In determining si)e<‘ific gravity by the formula ,•? A/(A — W), where A is the 
weight in air and W is the weight in wattT, A can be read within 0.01 lb and TF within 
0,02 lb. Find approximately the maximum error in s if the readings are A » 1.1 lb 
and W 0.6 lb. Find the maximum relative error A«/s. 

8. The equation of a perfect gas is pv » RT, At a certain instant a given amount of 

gas has a volume of 16 ft® and is under a pressure of 36 psi. Assuming R * 10.71, find 
the temperature T. If the volume is increasing at the rate of } i cfs and the pressure ia 
decreasing at the rate H temperature is changing. 
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9. The f)eriod of a simple pendulum with small oscillations is 
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T *« 2ir\/~‘ 

If T is computed using Z =« 8 ft and g ^ Z2 ft per sec per sec, find the approximate 
error in T if the true values are I * 8.05 ft and g ** 32.01 ft per sec per sec. Find also 
the percentage error. 

10. The diametf^r and altitude of a can in the shape of a right circular cylinder are 
measiured as 4 and 6 in., respectively. The possible error in each measurement is 0.1 in. 
Find approximately the maximum possible error in the values computed for the volume 
and the lateral surface. 

11. We define an approximate relative error t m the differentiable function / by the 
formula e « df/f. Show that the approximate relative error of the product is equal to 
the sum of the approximate relative errors of the factors. Ihnt: e ^ d log/. 

4. Chain Rule. Let v = fix.y) be a function of the variables x and y 
which, in turn, are functions of some independent variable Jf ^ is given 
an increment A/, the functions x and y will acquire increments Ax and At/, 
and consequently u will receive an increment Au. 

Assuming that u = f(x,y) is continuous together with its partial deriva¬ 
tives, one can write [see (3-14)] 


du du 

Au ^ — Ax H-- Ay + €i Ax + (2 ^2/- 

dx dy 

Dividing both sides of this expression by At gives 

Au du Ax du Ay Ax Ay 

— —-- q--j„ . q_ — • 

At dz At dy At At At 


(4-1) 


Now if it is supposed that x and y can be difTerent-iated with respect to f, 
the expression (4-1) gives, upon passing to the limit as At —♦ 0, 

du du dx du dy 
dt dx dt dy dt 


df dx df dy 
dx dt dy (It 


since ei —» 0 and €2 —> 0. The reason for the vanishing of €1 and €2 

0 is as follow's. Since x and y are assumed to be differentiable func¬ 
tions of t, the identities 


Ax Ay 

Ax » - Atf xc ~ M 

At At 

show that Ax —> 0 and Aj^ —> 0 as A^ 0. But when Ax —► 0 and 
Ay 0 we know that €1 and <?2 approach zero by (3-16). 
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Formula (4-2) gives the rule for the differentiation of composite functions. 
It is clear that if u is a function of a set of variables, xxj a*2 ,..., where 
each variable is a differentiable function of an independent variable 
the derivative of u with respect to t is given by the chain rule: 


(4-3) 


du du dxi du dx 2 Ou dxn 

dt dxi dt dr2 dt d.r„ di 

A special case of formula (4-2) is of interest. If it is assumed that 


t = :r, (4-2) becomes 


dv 

dx 


du du du 

"" H-- • 

dx dy dx 


(4-4) 


Formula (4-4) can be used to calctilalo the derivative of the implicit 
function given by 

fU.y) - 0, (4-5) 

Let it be assumed that (4-5) can be solved tor y to yield a real solution 

y = (p(t); (4-6) 


then the substitution of (4-6) in the l(‘ft-han(i member of (4-5) gives an 
identity 


0 = /(x,^), where y = <^(x). 

Applying (4-4) to ft-7) gives 

df df dy 

0 = 

dx dy dx 

and solving for dy/dx^ 

dy df dx 

dx df dy 


(4-7) 


(4-8) 


The formula (4-8) assumes that df/dy dotvs not vanish for the point 
(xo,,Vo) at which the derivative is ealcMilated. 

Example 1. ljeif{x,y) « + x cos y « 0; then 

-- == 9x^y- + cos y, - x sm y 

dx dy 

dy 9x“y^ + cos y 

so that r- =* 

dx ()x^y - X sin y 

for all values of x and y that satisfy the equation 

-h X cos y « 0 

and for which i)x^y — x sin y 9^ 0. 

Example 2. Let « 0; then df/dx » 2x, df/dy » 2y. But it does not follow 

that 
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Tliis result is absurd inasmuch as the only real values of x and y that satisfy x® -f y* »« 0 
are x 0 and y « 0. Since df/dy vanishes at this point, the formal procedure used in 
obtaining dy/dx is meaningless. 

Example 3, Let /(x,j/) «= 0 represent the locus of a curve, and let P{xo,yo) be a point 
on the curve. The equation of the tangent line to the curve at the point P is 


y - yo 



(x - Xo). 

z—xo 


It follows from (4-8) that this equation can be written in the form 


fx(ro,yo)U - xo) +/y(xo,yo)( 3 / - yo) ** 0. 


PROBLEMS 


1 . Find the equation of the tangent line to the ellipse 

at the point (xo,yo). 


X y* 
a2 ^ 


1 


2 . Find the equation of the tangent line to the folium of Descartes 

^ ^ ^fixy «= 0 . 


Note particularly the behavior of the tangent line to the folium at (0,0). 

3. Find du/dt if 

|x « + c"* 


tan~ 


and 


4 . Find the equation of the tangent line to the ellipse 


X « « cos 0, 
y « 6 sm 0, 

at the point where B « x/4, 

6. (a) Find du/dl, if u « e' .sin yz and x y =* i — 1, z » 1 //; 

(6) find Ou/dr and du/dB, if a =» x“ — 4y^, x = r sec 6, and y ~ r tan 0. 

6 . (a) Find du/dx and du/dx, if « « x^ -f- i/ and y «= tan x; 

(6) given V = /(x,y,z), where x » r cos B,y =« r sin 0, z »» t; compute BVIdr, dV/dB, 
dV/dtm terms of dV /dx, dV I By ^ and BV/Bz. 


6. Differentiation of Composite and Implicit Functions. The reasoning 
employed in the preceding section can lie applied in obtaining the total 
differential, and hence the derivative, of a function of n variables 


where Xi =»= Xi{t), i =« 1, 2, ..n 

are n differentiable functions of a single variable L The resulting expression 
for the total differential is 
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A question arises concerning the validity of formula (5-1) in the case 
where the variables x* are functions of several independent variables 
i\,k, - Thus, let 

u ==/(xi,X2,...,x„) (5-2) 

be a function of the n variables x*, where the Xj are functions of the vari¬ 
ables ^i, t 2 y say 

X* = • - • j^m\ ^ == I, 2, .. > f Ti. (<5-3) 

If all the variables except one, say 4, are held fast, (5-2) becomes a function 
of the single variable 4 and one can calculate the derivative df/dfk with 
the aid of (5-1). The notation instead of df/dtk, is used to signify 

the fact that all variables except 4 are held fast 
Assuming the continuity of the derivatives involved, one can write 


df 

df dxx 

, df a/2 , 


df dXn 


XX —- 

d-- d“ • 


-- 

dl} 

Oxi 01 \ 

0x2 Oil 

OXn Oil 

df 

Of Oxi 

. Of 0X2 . 


Of OXn 


sra:-— 

d“ —-h • ' 



dh 

3xi 34 

0X2 0t2 

OXn 34 


Sf df dxi df 0x2 d/ dx„ 

dxi OU Otra dXn Otm 

If Of/Oil, 0f/0t2, 0f^0t,n are multiplied, respectively, by dti, 

..., dtm and the resulting expressions added, one obtains 


Oil 


+ 

+ 

+ 


df 





dl2 + • • ■ 

, 

d- dim 


a<2 


dl„ 


df 

/a/i 

Oxi 

3Xi 


( - rf/i 

+ - - d/2 d- • 

‘ * d- 

3xj 

\3/i 

3/2 

dU 

df 

/a/2 , 


a/2 


(- -rf/i 

d-d/2 d” * * 

• -i-( 

a/2 

\dii 

3/2 

dl„ 

df 

/dXn 

dXn , 

dx„ 


i — dh 

-|-- dt2 + • 


dXn 

\dti 

a<2 

dtr, 


dtn^ 


dtn 


The left-hand member of this expression is the total differential of /(xi^Xj, 
.. .,Xn), regarded as a function of the independent variables 4, 4, .. 
whereas the terms in the parentheses in the right-hand member are pre¬ 
cisely the total differentials of (5-3). Hence, one can write 

Of Of Of 

dtf = — dxi d- dx2 -f- ‘ + -— dxn, 

Oxi 3x2 OXn 
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which shows that formula (5-1) is valid whether the XtS are the independent 
variables or are functions of any set of other independent variables. 

The foregoing can be summarized as follows: 

ThEOKEM. If U ^ • -r^n), thm 

df df df 

du = — dxi H- dx 2 H-1- dxn, 

dXi dX2 dXn 

regardless of whether the variables Xi are the independent variables or are 
functioms of other independent variables ik. It is understood that all the 
derivatives involved {the df/dxk and dxjdtk) arc continuous functions. 

The fact that the total differential of a composite function has the same 
form irrespective of wliether the variables involved are independent or 
not permits one to use the same form\ilas for calculating differentials as 
those established for the functions of a single variable. Thus, 

d{u + v) — du + dvy 

d(ur) d{uv) 

d{uv) =- du H- dv 

du dv 


and so forth. 


V du u dvj 


Example 1, If u « j?/ + j/a + 2 x, a: y «*» e \ and z »» cos^, 

du , ^dz , .dy, dz 

^-(!/+*)- + (x+r)-- + (x+v)^ 

» («“* 4" cos 0(1) 4- (^ + cos0(~^“0 4- (^ + sinO 

e~'^ 4- cos t - — e~* cos i — isin I sin t. 

This example illustrates the fact that tliis method of computing du/dt is often shorter 
than the old method in which the values of r, y, and z in t/enns of t are substituted in 
the expression for u before the derivative is comiiuteci. 

Example 2. If f{z,y) ** 4- y^, where x =* r cos ip and y ** r sin yj, then 

df df dz df dy 9 . , 

— „-j-» 2x cos ^ 2y sin Ip ^ 2r cos^ ^ 4- 2r sin^ ip •» 2r. 

dr dz dr dy dr 


dip dx dip dy dip 


2x( ~r sin <p) 4" 2y(r cos ip) 


-2r^ cx)S yj sin ^ 4“ 2r^ cos Bin </> »* 0. 
or df *• 2x dx 4- 2y dy. 


Also, df «* 2r dr or df *• 2x dx + 2y dy. 

Since /(x,y) « x^ 4* 1 /^ * r®, these results could have been obtained directly. 
Example 3. Ijet z « c^, where i *“ log (u 4- v) and y « tan^*^ (u/v). Then, 


du u -tv 


% ^_ V 

du v^ 4" w* 



SEC. 5] 
Heiice, 


THE TECHNIQUE OP DIPPERENTIAlTON 


m 


Similarly, 


dz dz dz ^ dz dy ye^ xe^v 

du dx Su dy du w -f- V v* 4* w* 


dv u + t> 

The same results can he obtained by noting that 


dz ** dx -f xe^ dy. 

, dx , dr , 

dx *w — au d- -dv ’OS - 

du dv i 


I 1 

- du 4- - dv 

u 4- e w 4- e 


du dv V u 

and dj/« — du 4de « ,— -dv. 

du dv tP ^ u® 

du 4 dv . V du - // dv 

Hence, dz =* -;- \ ^ 

M 4 V" 4 ir 

/ xif^v \ / xe^'^u \ 

\M 4 4 wv \u 4 t» iP- 4 uP/ ^ 

j dz , dz , 

But dz osL - (in \ “ dr, 

du dv 

and since du and dv are independent (iifferentials, equating the coefficient's of du and 
dv in the two expressions foi dz givei? 

dz Xf^^v 

du u r^ 4 u^ 


dv u 4 e e* 4 fP 

Let f{x^y,z) = 0 define any one of the variables as an implicit function 
of the remaining ones. If x and y are thought to be tlie independent 
variables and one can obtain a rea^ differentiable solution for z in terms 
of X and y, it is possible to write 

dz dz 

dz — — dx ’i - dy, 

dx dy 

df df df 

But df - — dx 4- - dy + ~-dz 0. 

dx dy dz 

Substituting the value of dz in this equation gives 


■ dx H- dy + 


dx H- dy 


fdf dfdz' 

— 4-- 

Kdx dz dx, 


df (dz dz \ 

t! \dw & byf 
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Since x and y are independent variables, we get, on setting in turn dx 
and dy » 0, 


df df 3z 
—+ = 0 
dx dz dx 


0 


and 


df df dz 
dy dz dy 


= 0 . 


These equations could have been obtained directly by applying the chain 
rule to the equation f(x,y,z) = 0, in which z is regarded as a function of 
X and y, but we wished to illustrate another procedure followed in Sec. 10 
and elsewhere. If df/dz 0, these equations give 


dz _ df/dx dz _ df/dy 

dx df/dz dy df/dz 


(5-4) 


The formulas (5-4) permit one to calculate the partial derivatives of 
the function z defined implicitly by an equation 

S{jc,y,z) = 0. 

As an illustration, let 

x^ + 2/ - 3tz 4- 1 = 0. 

llien, by (5-4), 

dz 2x — 3z ^ dz 4y 

dx —3x dy —3x 


Example 4, A function.. .,Xn) of n variables j*i, xt, ..Xn is said to be homo- 
gmtoxtx of degree vi if the function is multiplied by X"* when the arguments xi, X 2 , .. 

Xn are replaced by Xxi, Xxa, ...» Xxn, respectively. For example, f{x,y) «• x^/Vx* -f 
is homogen^is of degree 1, becaase the substitution of Xx for x and Xy for y 3 rielci 8 
XxVVx'^ -f i/I Again, OJy) -h (logx ~ log y)/x is homogeneous of degree 

—1, whereas/(x,i/,z) ** -b is homogeneous of degree 

There is an important theorem, due to Euler, concerning homogeneous functions. 
Eulkr's Theorem. If u ^ /(Ji»X2, • • w homogeneous of degree m and has con- 
iinttous first partial derivatives, then 

df df df 

xi r- h X2~ -1-h — « m/(xi,X2,... ,atn). 

vJr2 


The proof of the theorem follows at once upon substituting 

x'l » Xxi, « XX2, ..., x« «« Xxn. 


Then, since/(xi,X 2 ,.. .,Xn) is homogeneous of degree m, 

fi^if^t • • • • • »®n)* 
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Difterantiating with respect to X gives 

a/ df df 

^ iii .*-)• 

If X is set equal to 1, then xi xi, ^ xi, ..JTn “* x^ and the theorem follows. 


PROBLEMS 


1. (a) Find dy/dx if ac sec y -f « 1; 

(6) find dz/dx and dz/dy if x^y — sin 2 -f z* * 0 

2. If / i^a f unct ion of u and t», where n « Vx*^ -h y^ and v - tan*^^ iv/x), find d//dx, 

af/dy, V(df/dxf '+Tdf/ay)l 

8. If / is a function of u and t>, where u — r cos a and i* * r sin a, find 


dr' dB* 



4. If X » x' cos a — y* sin $, y ^ x' sin a -h y' cos e, prove that 

6. Find the total differentia! if k -f y^, x = r cos a, and y « r sin a. 

6. If / « where x « log (u^ 4-1»^)^ and y « tan~^ (u/v), find df/du and df/dv. 

7 . If 2 * (a -f e)/(l ~ uv), u « y sin x, and v «= e*'', find dz/cix and dz/dy, 

8. Find dz/dr and dz/ds if 2 « (x — y)/(l *f xy), x « tan (r -- s), and y «« c***. 

9. Verify Euler’s theorem for each of the following functions: 


(a) f(x,y,2) 

(b) f{x,y) - 

(c) /(x,y) = 

(d) /(x,y,2) 


= x^y “h + 2 xyz\ 

y 


1 log X - log y 
y2 ■ X* ’ 




{€} f{x,y,z) 
(/) /(x,y) - 

(a) /■(a:,y) » 

ih) Kx,y) = 


« (x^ + y^ 'h 2*)“^; 

^Iv. 

Vx -f y ^ 

y 

-f y^ 


6. Higher Derivatives of Implicit Functions. The problem of calculating 
the derivative of y with respect to x when y is an implicit function of the 
independent variable x defined by 

f(x,y) ■« 0 (6-1) 

was discussed in Sec.4. It was shown there that 

dy 

fx{x,y) + fy{x,y) 7 “ == 0. (6-2) 

ax 

Differentiating this equation again and assuming that all the derivatives 
involved are continuous functions of x and y give 

dy /dy\^ d^y 

fxx(x,y) + 2f^„ix,y) ~ + fm{x,y) ^j + fy{x,y) — = 


0. (6-3) 
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If fit(x,y) 0 at the point where the derivative is desired, (6-3) can 
be solved for dJ‘y/dx^ and the value of dy/dx substituted from (6-2). The 
is 

A ^ fxxfl - 2S^JJy -h fyyfi 

dx^ fl 

The process can be continued to obtain the derivatives of higher orders. 

A similar procedure can be employed to calculate the partial derivatives 
of a function z of two independent variables x and y defined implicitly 
by an equation of the form 

f{x,y,z) = 0. (6-4) 

Differentiating (6-4) with respeot to x and y in turn gives 


fxi3-,v,z) + fz(x,y,z) 


dz 

dx 


= 0, 


fyi^yV/) +fz{3r,y,z) 


dz 


0 . 


(6-5) 


If fz(Xyy,z) does not vanish for those values of y, and z that satisfy 
(6-4), then Eqs. (6-5) can he solved for dz/dr and dz/dy. Partial deriva¬ 
tives of higher order can then be obtained by differentiating ecpiations 


Example IjcI it be required to find the derivatives of second order of the function z 
defined impliciitly by the equation 


yl 

-L ‘I- 4. . 

a2 ^ 62 ^2 


1. 


Differentiating this equation with respect to x and y gives 


2t 2r dz 


0, 


2y 2z dz 


0 . 


Differentiating the first of Eqs (6-6) with n'spoct to x and j/, one obtains 
2 2 /az\’ 2td‘^z ^ 


2 dz dz 2z dh 

-- ^ - - m 0. 

c* dx dy r dx dy 

Solving for dhld^ and dh/dx dy and making use of (fi-6), one obtains 

dh 


m) 
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In a mmilar way the differentiation of the second of Eqs. (6-6) with respect to y yields 

d^z -h c?y^ 

_ « ^ 


PROBLEMS 


1. Find y\ y'\ y”* if x* -f / — « 0. 

2. Find dz/dx, d^z/dx^, dhjdx dy, and dhfdy^ at (1,1,1) if a:* — -f- »« 1. 

3. Find dz/dx, if 

(a) xz^ — 1/2* -f xyh — 5 « 0; (6) arz* - i/z -h 3^?/ « 0. 


7. Change of Variables. The main purpose of this section is to develop 
manipulative skill in calculating the derivatives of imidicit functions and 
to indicate the formal modes of attack on the problem. The continuity 
of the functions and their partial derivatives is assumed throughout this 
section and will not be referred to again. 

w = /(w,r) (7-1) 


denote a function of two independent variables u and and suppose that 
u and V are connected with some other variables x and y by means of the 
relations , . 

X = x(n,v)y 


y = y{u,v). 


(7-2) 


If Eqs. (7-2) are solved for x and y to yield 

u = u{x,y), 

V = v{x,y), 


(7-3) 


and the expressions (7-3) are substituted for u and v in (7-1), there will 
result a function of x and say 

w == F{x,y). (7-4) 


The partial derivatives of w with respect to x and y can be calculated 
from (7-4) directly, but frequently it is impracticable to obtain the solu¬ 
tion (7-3), and we consider an indirect mode of calculation. By the rule 
for the differentiation of composite functions, 


dw 

du 

dw 

dv 


div dx dw dy 

dx du dy du 

dw dx dw dy 

dx dv dy dv 


(7-5) 


The partial derivatives dx/du, dy/du, dx/dv, and dy/dv can be calculated 
from (7-2), and hence they may be regarded as known functions of u and 
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V, The partial derivatives in the left-hand members of (7-5) are also 
known functions of u and v, since they can be calculated from (7-1). 

Hence, equations (7-5) may be regarded as linear equations for the 
determination of dw/dx and dw/dy. Assuming that the Jacobian J{UjV) 
defined by 

\dx I 


JM s 


du du 
dx dy 


dv dv 


is not zero and solving by Cramer^s rule give 



dw dy 


dx dw 


du du 


du du 


dw dy 


dx dw 

dw 

dv dv 

dw 

1 dv dv 

dx 

J(u,v) 


J iu,v) 


The resulting expressions for dw/dx and dw/dy are known functions of 
u and V and thus can be treated exactly like (7-1) if it is desirable to cal¬ 
culate the derivatives of higher orders. 

As an example, consider the function w{rfi), and let it be required to 
calculate the partial derivatives of w with respect to x and y, where x « 
r cos 6 and y — rain B, Now 


dw 

dw 

dx 


dw 

dy 

dw 

dw 


- s 

E - 

— 

+ 

— 


= — cos B + 

— Bin d. 

dr 

dx 

dr 

dy 

dr ^ 

dx 

% 


dw 

dw 

dx 


dw 

dy 

dw 


dw 

— = — 

— 

+ 

— 


=-rsin 

0 + 

— r 

dB 

dx 

d^ 


dy 

dB ^ 

dx 

dy 


Solving these equations for dw/dx and dw/dy in terms of dw/dr and dw/dB 
gives 


dw dw sin B dw 

— = cos B -—» 

dx dr r dB 

dw dw cos 6 dw 

— ^ Bin 0 -1- 

dy dr r dB 

The Jacobian J is, in this case, 

cos B sin B 
—rsin^ rcos0 



which does not vanish unless r »» 0. 
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As a somewhat more complicated instance of implicit differentiation, 
consider a pair of equations 


F{x,y,u,v) =« 0, 
G{x,y,u,v) == 0, 


(7-6) 


and let it be supposed that they can be solved for u and v in terms of x 
and y to yield 

u = u(x,y), 

(7-7) 

V = v{x,y). 


The partial derivatives of u and v with respect to x and y can be obtained 
in the following manner. Considering x and y as (he independent \ariables 
and differentiating Eqs. (7-6) with respect to r and y give 


dF 


dF du dF dv 


dF 

dF du 

dF dv 

— 

+ 

-- -- 

= 0, 

- + 

-1_ 

-= 0, 

dx 


dv dx dv dx 

dy 

du dy 

dv dy 

dG 


dG du dG dv 


dG 

dG du 

dG dv 

— 

+ 

--- 

= 0, 

— + 

- h 

— — = 0. 

dx 

du dx dv dx 

dy 

du dy 

dv dy 


Equations (7-8) are linear in du/dx, du/dy, dv/dx, and dv/dij. If 


J{%v) - 


dF 

dF 

du 

dv 

dG 

dG 

du 

dv 


the partial derivatives in question can be determined from (7-8) by 
Cramer’s rule. 

A special case of Eqs. (7-6) is useful in applications. Let 

X = /(w,t’), 
y = giu,v). 

Differentiating these equations with respect to x and remembering that 
X and y are independent variables, one obtains 

df du df dv 

1 « —— +- 

du dx dv dx 

dg du dg dv 

du dx dv dx 


(7-9) 
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Example 1. Let 




du 

dv 



du 

dv 


u* -* v® -f“ 2® » 0, 

M*> — y « 0. 


Differentiating with respect to 

/ dv \ 

du dv 

ff -j- — » 0. 

dx dx 

^ du u dv V 

Hence — «-s-5. -» -r-- 

dx -f dx -f tr^ 

Differentiating the first of these results with respect to x gives 

dw , . . / du dv\ 


4 -1^) + 2 -f V u 


da^ (u^ -f 

u(u^ 4 — 2u{u^ — tr) u(Zv^ — w^) 

“ ““ ru^'TvY^ 

One obtains similarly dh/dx^^ d^u/dx dy, and higher derivatives. 
Example 2. Let 

r •« a 4 
(a) 

.V ** 3u 4 2v. 

Differentiating with respect to x, 

du dv 

1 * r- 4 


,v ^ du dv 
0 « 3 — 4 2-~r 
dx dx 


SO that 

It is easily checked that 


dv 

-- « 3 . 
dx 


Equations (a) can be solved for w and v in terms of x and y, and the result is 


u «* ~-2x 4 Vt 
V ^ ^x - y. 
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Eeg&rding u and v as the independent variables and differentiating these equations with 
respect to w, one finds 


1 


du du 


0 » 3 


dx 

du 


?1, 

du 


Hence, 


ax 

du 


1, 


du 


« 3. 


'i'his agrees with the result obtained by direct differentiation of (a), as of course it should. 
Note that du/dz and dx/du are not reciprocals. 

Example 3. If w « uv and 

-b V ~f* r =*= 0, 

(^) 

— u — p =« 0, 

one can obtain dw/dx as follows; DiffiTentiation of w with respect to x gives 

dw dv du 
— « u ~ 4- i» — 
dx dx dx 


Th(‘ valuer of du/dx and <H!/dx can be calculated from (h) as w^as done in Example 1. 
riie readi*! will check that 

dw u 4" dw 2u^ — v 

dx 14“ ^uv dy 1 4* 4wt» 


PROBLEMS 

1, If u* 4- 4- — 2x «« 0, 4- e® — x^ 4- 3y =* 0, find dujdx, dv/dx, du/dy, and 

dv/dy. 

2. Find dw/dx and dw/dy if w =« u/v. 


and 


j t* u 4- 
y * 3u 4“ 2v. 


3. »Show that if f(x,y,z) « 0, i\m\{dz/dx)(dz/dz) » 1 D.nd{dx/dy)(dy/dz){dz/dz) «= —1. 
NoU* that in general dz/dx and dx/dz are not reciprocals. 

4 . If j a* x{u,v), y »« y{u,v) with dx/du =» dy/di\ and dx/dv *= —dy/du, then 

+ !!n r f-Vl • 

du* dv* \dx* dy*/ L Vdu/ \dv/ J 


3. Show that the expressions 





and 


V2 


dh dH 

a? 


Upon change of variable by means of x 


r cos d, 1 / "• r sin d, become 
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and 

6. Show that 




V2 



dh 1 d^z I dz 
ir* r* "** r ir 


ax^ 


if V ^ fix -h a) + gii 
second derivatives. 

7. Show that 


rO, where / and g arc any functions possessing continuous 


dx“ dy^ \ dr^ / 


if X » e’' cos B, y ^ sin 0. 
8. Find Bu/dx if 


9. Prove that 


^ t ,2 _ JJ.3 3y 3 . 0 ^ 

M -f c — y/2 — 2x « 0. 


dti dy dv By 
dx du dx Bv 


if F(x,|/,u,c) « 0 and G{x,y,v,v) « 0. 

10. If Vi{x,y,z) and V 2 {x,y,z) satisfy the equation 


V'V 


B~V 


0 , 


then 


U » ri(x,y,z) + + !/' + z'‘)Vi{x.y,z) 


satisfies the equation 


where 


V2 r « 0, 


V2 


d" 

dx^ By^ Bz^ 


11, To indicate explicitly the variables entering in the Jacobian 


Jiu,v) 


dx By 
du du 
dx By 
dv dv 


one frequently writes J(u^v) 



The Jacobian 




du 

dv 

Jx 

dx 

du 

dv 

dy 

~^y 


of the transformation (7-3) is written as /(x,y) »■ J (™ j > 

\x,y/ 


Prove that: 
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where u « u(x,yl v « v(x,y), x « x(^,»?), atid y « y(^,fj). Hint: Write out the Jacobians 
and multiply. 

APPLICATIONS OP DIFFERENTIATION 

8. Directional Derivatives. Formula (4-2) has a simple geometrical 
meaning wlien interpreted as the space rate of change of a given function 
u(x^y). Thus, let u(x,y) be specitied along a smooth curve C with para¬ 
metric equations _ 

X — x(s)f 

(8-1) 

y - y(s)^ 

where $ is the arc-parameter measured along C. By virtue of Eqs. (8-1), 
'u(x,y) can l)e regarded as a function of s and the rate of change of u(x,y) 
along C is 

dv du dx du dy 

~ - +- ( 8 - 2 ) 

ds dx ds dy ds 

At a given point Po(xoryo) on C, Kq. (8-2) yields 


^x(xo,2/o) (*os a + Uy(xodJo) sin a, 


since dx/ds = cos a and dy/ds »= sin a, as is clear from Fig, 5. It follows 
from (8-8) that the rale of change of 
u(x,y) at a given point depends only on 
the direction of the curve pas.sing thro\igh 
that point. If tlie direction of C is tiiat 
of the x axis, the angle a ~ 0 and du/ds 
= du/dx; if the direction of C is that of 
the y axis, a == ir/2 and du/ds = du/dy. 

For an arbitrary direction specified by a, 

Eq. (8-3) defines the directional derivative 
of u(x,y) in that direction. Thus the de¬ 
rivatives Ux and Uy axe directional deriva¬ 
tives in the directions of the coordinate Fm. 5 

axes indicated by the subscripts. 

We now ask the question: What is the angle a for which the directional 
derivative of w(x,y) at a given point has a maximum value? Since a 
necessary condition for a maximum is the vanisliing of the derivative of 
(8-3) with respect to a, we get the equation 
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—«»(* 0 )!/o) sin a + Ug(xo,yo) cos a » 0, 
from which wo conclude that when Ut{xo,yo) 3^ 0, 

^vixo.yo) 


tan a 


'^xixo.yo) 


(8-4) 


Accordingly, there are two values of a differing by 180° which satisfy the 
condition (8-4). The corresponding values of cos a and sin a in (8-3), 
therefore, are 

rfcrt/x . 


COS a = 


^r~ 2* , 2 

V 4- 


sin a = 


Vul + vl 


(8-5) 


The substitution in (8-3) of the values from (8-5) with the plus sign yields 
the desired maximum 


CA -VZ: 

\di>/ niftx 


* -F uh 


(8-6) 


while the other pair of values in (8-5) gives a minimum 
( 7 ^) =-Vw2+u2. 

\dh/ 

The vector pointing in the dire(‘tion of the greatest rate of increa^^e of 
u{x^y) at a given point (x,y) and whose length is determined by (8-(>) is 


/du\ 

called the gradient, and i — j 

\d^ / max 

the normal derivative by du/dn and write 

da 


is called the normal derivative} We denote 


dn 


= + nl • 


(8^7) 


A similar discussion can be applied to a differentiable function u(x,y,z) 
defined along a space curve C with parametric equations 

X - .rf,s), 


y ^ 

z = z(s). 

We get 

du dudx dudy dudz 

ds dx ds dy ds dz ds 

^ The reason for this terminology is given in Chap. 5, Sec. 3. 


( 8 - 8 ) 


(8-9) 
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where (see Fig. 6) 

dx dy ^ ^ dz 

— «= cos (x,a), -j- =* cos = cos ( 2 ,S) 

as ds ds 


( 8 - 10 ) 


are the direction cosines of the tangent line T at a given point P of C. 

To determine the particular direction yielding a maximum of (8-9) at a 
given point P(To,yo,2:o), we must maximize the resulting function of the 



direction cosines in (8-9). Tiiis problem, involving the determination of 
a maximum of functions of several variables, is discussed jn Examp^^ 3, 

/du\ 

Sec. 10, where it is shown that I —) for (8-9) is given by the formula 

Vd.Vmax 

du ,—- 

— = V Wi + wj + wj , (8-11) 

dn 

analogous to (8-7). The expression (8-11) is called the normal derivative 
of u. 

Example 1. Find the directional derivative for u{x,y) « -h at (1,1) in the direc¬ 
tion making the angle of 30® with the positive x axis. 

Formula (8-3) yields 

— 1 - 2i I cos 30“ + 2y\ sm 30“ - VS + 1. 

dn 1(1,1) 1(1,1) 1(1,1) 

The normal derivative at this point, as found with the aid of formula (8-7), is 

^ + (2j/)* I “ 2V2 

an 1(1,1) 

and the corresponding angle «, as follows from (8-4), is 45®. 

Example 2. Find the directional derivative of xi{Xyy,z) * xyz at (1,2,3) in the direc¬ 
tion of the line making equal angles with the coordinate axes. Since the angles are 
equal and the sum of the squares of the direction cosines is 1, we conclude from 

cos® (x.«) -I- cos® {y,8) -F cos® («,«) «* 1 
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that cos (XfS) cos (y,s) » cos (*,«) * l/\/S. Also, at the point (1,2,3) we have 


du ^ du 

T- * 2/2 *" 6, — 

dx dy 

The substitution in (8^9) then yields 
du 
d$ 


« X 2 *» 3, 


du 

Jz 


« xj/ *• 2- 


a/ 3 VS *“ Vs’ 


Example 3. Show that the directional derivative of u(x,y) in two noncoUinear direc¬ 
tions determines the derivative in all directions. 

Let the derivative be given for directions ao and on, so that 

Ux cos ofo “h sin oso * o, 

u, cos ai -f Uy sin ai as b, 

where a and h are known. If these are regarded as equations for the unknowns u* and u,^ 
the coefficient determinant is 


I cos ao sin ao I 

. a® cos ao sm ai — cos oti Sin ao* 

I COS ai sm aj I 

This reduces to sin (ai — ao), which is zero only if the two directions are collinear. Hence 
Ux and Uj, can be found, and the directional derivative is determined for every direction 
by (8-3). 


PROBLEMS 

1. Find the directional derivative of f(x,y) «= x^y -f sin xy at (l,ir/2), in the direction 
of the line making an angle of 45® with the positive x axis. 

2. Find 

if X » r cos $, y ^ r sin 6, and / is a function of the variablas r and (?. 

3. Find the directional derivative of fix,y) «= x^ij -j- c*'* in the direction of the curve 
which, at the |X)int (1,1), makes an angle of 30® with the x axis. 

4. Find the normal derivative of u «* x^ -f -f at Ihe point (1,2,3) and the direc¬ 
tional derivatives at that point along the line joining (0,0,0) and (1,2,3). 

9. Maxima and Minima of Functions of Several Variables. A function 
defined in a region R is said to have a relative maximum at a point 

(a,b) If ^f(^a + h,b + k) - f{a,b) < 0 (9-1) 

for all values of h and k in the neighborhood of (a,b). It is said to have a 
relative minimum at {a,h) if 

A/ ^ J{a + h,h + k) — f(a,h) > 0 (9-2) 

for all values {h,k) in the neighborhood of (a,6). 

The requirement that the inequalities (9-1) and (9-2) hold for all values 
{h,k) in the neighborhood of (a,6) implies that we are concerned here 
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only with the interior and not the boundary points of the region. A func¬ 
tion may attain a maximum or a minimum value on the boundary of the 
region, but the behavior of functions on the boundary requires a separate 
investigation, the nature of which will be clear from the sequel. The 
greatest and least values assumed by f{x^y) in the closed region are called, 
respectively, the absolute maximum and the absolute minimum. In the 
following discussion we dispense with the adjective ^^relative,^' and we 
shall refer to relative maxima and minima simply as maxima and minima. 

Let it be assumed that f{x,y) attains a maximum (or minimum) at some 
interior point {a,b). Then the func¬ 
tion f(x^b) of the variable x must 
attain a maximum (or minimum) at 
X — a. From the study of functions 
of one variable it follows that the 
derivative of f(x,b), if it exisf,s, must 
vanish at x — a. The derivative may 
cease to exist at the critical points 
when the behavior of the function is 
like that shown in Fig 7 in the neigh¬ 
borhood of X = 0 ], .r = 02 , and x — 03 . 
a maximum (or minimum) of f{x,b) at x 



Thus, a necessary condition for 
= a is that 


dx 


0 


(9-3) 


if this derivative exists at x — a. 

A similar consideration of the function f{a,ij) leads to the conclusion 
that 

d/ 

— == 0 Sit y = b ( 9 - 4 ) 

whenever this derivative exists. 

The coordinates (a, 6 ) thus satisfy the pair of equations 





(9-5) 


at any point {a,b) where/(x, 2 /) attains a maximum or minimum. 

This discussion is capable of extension to functions of any number of 
variables to yield a theorem. 

Theorem. A function /(xi,X 2 ,... ,Xn) of n independent variables attains 
a maximum or a minimum (mly for those values of the variables Xifor which 
/xp •. *, /x„ either vanish simultaneously or cease to exist. 

We emphasize that the conditions stated in this theorem are necessary 
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but not sufficient for a maximum or a minimum.^ Althougli tlie matter of 
sufficiency can usually be determined from the nature of the problem and 
from physical considerations leading to its formulation, w© record here a 
test that may prove useful to settle doubtful cases.^ 

If /(^,?/) is a function with continuous second partial derivatives, and if 
/»(u,h) «= 0 and fyia,h) — 0, then /(a,6) is a maximum provided that 

D = fly(a,b) - fTx(a,h)fyy{<l,b) < 0 

and /rx(u,^>) < 0, fyy{a,b) < 0; it is a ymnimvm if Z) < 0 and/rx(a,f>) > 0, 
> 0; it is neither maximum nor minimum (a saddle point) if 
1) > 0. This test gives no information if D = 0, just ns the condition 
f"{a) - 0 gives no information for the function f{x) with /'(a) = 0 . 

Before proceeding to a further study of maxima and minima, we give 
two examples illustrating the developments of this section. 


Example 1. A long piwo of tin 12 m 



wide 18 made into a trough by bending up 
th(' sides to form equal anglos with the ba8<‘ 
(Fig. 8). ]''ind the amount to lx* bent up 
and the angit' of inclination of the sido.s tliat 
Will make tlu' carrying cai)acitv a maximum. 

1'h(‘ voluiiu' will b(' a maximum if the 
area of tlie 1 rap(‘zoidal ctoSkS section is a 
ma.ximuni d'ho area is 

A » J2r sin 0 — 2j“ sin i? sin 0 cos 


for 12 — 2x is the low'er base, 12 — 2j: + 2j cos 0 i& the upjier base, and x sin 0 is the 
altitude. Then, 

— « 12x cos 0 — 2j“ cos /9 -f ros^^ fl — sirr 0 
d0 

= x(l2 cos 6 — 2x cos 0 A- x cos'’ 0 ~ x sin^ 0) 

. d.4 . , 

and -— ~ 2 sm 0(G — 2x A- x cos 0). 

dx 


Now dA/dx ^ 0 and dAfSO 0 if sin ~ 0 and x « 0, which, from phy.sic.al considera¬ 
tions, cannot give a maximum. 

There remain to be satisfied 

6 ~ 2x d- X cos ~ 0 


and 


12 cos 0 ~ 2x cos 0 X cos^ 0 — x sin^ 0 ~ 0 


Solving the first equation for x and substituting in tlie second 3U(*ld, upon simplification, 

cos 0^1^ or 6 ^ 60”, and x = 4. 

Since physical considerations show that a maximum exists, x «= 4 and 0 « 60® must 
give the maximum. 


* Recall, for example, the situation when/(x) has a point of inflection with a horizontal 
taiigent, 

^ * A proof and further discussion are contained in 1. S. SokolnikofT, “Advanced Calcu* 

sec, 89, McGraw-Hill Book Company, Inc., New York, 1939. 
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ExampU 2. Find the maxima and minima of the surface 


« 2c». 


Now, 

which vanish when x ^ y 
dx^ * 




di lx 
dx c o® 

« 0. But 

-L ^ 

dy^ 


dy 


Wc 


l^ 

c6*’ 


dx dy 


0 . 


Hence, D « l/a*6V, and consequently, there is no maximum or minimum at a: =* y * 0. 
The surface under consideration is a saddle^sliaped surface called a hyperbolic paraholoid. 
The points for which the first partial derivatives vanish and /> > 0 are caDed minimax. 
The reason for this odd name appears from a consideration of the shape of the hyperbolic 
{laraboloid near the origin of the coordinate system. The reader will benefit from sketch¬ 
ing it in the vicinity of (0,0,0). 


PROBLEMS 


1. Divide a into three* parts such that their product is a maximum. Test by using 
the secon< 1-derivative criterion. 

2. j'lnd the volume of the largest rectangular parallelepiped that can be inscribed in 
the ellipsoid 


a* ^ ^ c* 


= 1 , 


3. Find the dimensions of the largest rectangular parallelepiped that has three faces 
in the coordinate planes and one vcrt<»x in the plane 


X V z 

" 7 + “ 

a 6 c 


1 . 


4. A pentagonal frame is composed of a rectangle surmounted by an isosceles triangle. 
What are the dimt'nsions for maximum area of the fientagon if the perimeter is given as P? 

5. A floating anchorage is designed with a body in the form of a right-circular cylinder 
with equal ends that are right-circular cones. If the volume is given, find the dimensions 
giving the iniiiimum surface area. 

6. Given n points P^ whose coordinates are {x%,yi,Zt) {i 1,2, ..., n). Show that the 
coordinates of the point P(x,yjZ), such that the sum of the squares of the distances from 
P to the P* is a minimum, are given by 



10. Constrained Maxima and Minima. The discussion in the preceding 
section was confined to the calculation of the maximum and mi niTpum 
values of functions of several independent variables. In a large number 
of investigations, it is required that the maximum and minimum values 
of a diflerentiable function /(xi,a; 2 >.. be found when the variables Xi 
are connected by some functional relationships, so that the Xi are no longer 
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independent. Such problems are called problems in constrained maxima 
to distinguish them from the problems in Jree maxima discussed in Sec, 9. 

To avoid circumlocution, we shall speak of the maximum or minimum 
values as the extreme values. Thus, let us consider the problem of finding 
the extreme values of the function 

w (10-1) 

in which the variables ar, z are constrained by the relation 

== 0 . ( 10 - 2 ) 

This problem can be solved by the procedure of Sec. 9 as follows: Suppose 
that the constraining relation (10-2) is solved for one of the variables, 
say s, to yield a differentiable function 

z =- (10-3) 

If one substitutes z from (10-3) in (10-1), there results the function 

V = /[x.yM^yV)] ^ (10-4) 

of two independent variables x, y to which the considerations of »Sec. 9 
apply. 

However, either the solution (10-3) may be difficnilt to obtain or the 
function F(x,y) in (10-4) may be so unwieldy that the simultaneous (‘cpia- 
tions Fx(x,y) ~ 0, Fy{x,y) = 0 are unpleasant to deal with. In this 
event an ingenious method devised by the great Frein^li analyst Lagrange 
often leads to a manageable and symmetric system of equations for the 
determination of extreme values. The central idea of the method hinges 
on the following observation. In Sec. 9, we saw that a nec'essary condition 
for a relative extremum of tlie differentiable function/(.ri,j: 2 ,... ,Xn) of 
n independent variables is the simultaneous vanishing of all partial deriva¬ 
tives/x,. Inasmuch as the total differential of / i.s 

== fxi dxi + /x, t/X2 H-h /x„ dxnf 

it is clear that df - 0 whenever each /x^ ^ 0. Conversely, if df = 0, the 
partial derivatives /x, vanish, since the dx^ are independent. But it is 
also true that the vanishing of the total ilifferential is a necessary condi¬ 
tion for an extremum of /(xi,T 2 ,. . even when the variables x^ are 
dependent because of the invariant character of df stated in the theorem 
of Sec. 5. We can thus state a theorem: 

Theorem. A necessary condition for an extremum of a differentiable func¬ 
tion f(xxjX%, ... jXn) i^ the vanishing of its total differential at the maximum and 
minimum points of the function. 

We proceed now to a discussion of the method of Lagrange multipliers 
for determining the extreme values of the function in (lO-I) subject to 
the equation of constraint (10-2). 
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By the theorem just stated, the differential of (10-1) vanishes at the 
critical points so that 

df df df 

^dx + —dy + -^(k=^0, (10-5) 

dx dy dz 

Also, since ^(x,^/, 2 ) «= 0, its total differential vanishes and we can write 


d<f> dtp dtp 

— H- dy - dz = 0 . ( 10 - 6 ) 

dx dy dz 

Let Eq. (10-G) be multiplied by some parameter X and then added to 
(10-5). The result is 



(10-7) 


If we regard x and y as independent variables, and sup])ose that d<p/dz 9 ^ 0 
at the point where the extremum is attained, then we can find a X such 
that at this point 


df dtp 

- + X — 
dz dz 


= 0. 


( 10 - 8 ) 


With this choice of X, Eq. (10-7) reduces to 


/ d/ dtp\ / df dtp\ 

(- hX~— ) dx I -hX — j dy = 0. 

\dx dx/ \dy dy/ 


But since dx and dy arc independent increments, we conclude from this 
equation that 

df dtp 

~ + X—= 0, 
dx dx 


dy 


dtp 

+ X -- - 0. 
dy 


(10-9) 


The system of three equations (10-8) and (10-9) contains four unknowns 
T, ?/, X, and we must adjoin to it the fourth equation (10-2) to obtain the 
complete system for the determination of the unknowns 
If dtp/dz = 0 at the point where the extremum is attained, but dtp/dy 9 ^ 0, 
the roles of z and y in the foregoing discussion are interchanged. Clearly, 
the method will fail to yield the desired value of X when tp^^ tpy, and tp^ 
vanish simultaneously at the point where f{x,y,z) has an extremum. 

Before proceeding to extend the Lagrange method to the study of 
extreme values of functions with several constraining conditions, we con¬ 
sider four instructive examples. 
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Emfftple 1. Find the maximum and the minimum distances from the origin to the 
curve 

5x2 ^ 4- -- 8 ^ 0. 

The problem here is to determine the extreme values of 

f(x,y) « x2 4- 

subject to the condition 

<p(x,y) 3s 5x2 4- 4- 52/2 -- 8 =s 0. 

Equations (10-9) and (10-2) in tliis case read 

2x 4- X(l0x 4* 6y) « 0, 

2y 4- X(6x 4- 10^) - 0, 

Sx® 4“ 6xj/ 4- 57/2 — 8 =* 0. 

Multipl 3 dng the first of these equations by y and the se<;ond by x and then subtracting 
give 

6X(|/2 — x2) «• 0, 

80 that y « 4:x. Substituting these values of y in the third equation gives two equations 
for the determination of x, namely, 

2x2 ^ 3.2 ^ 2. 

The first of these gives/ m x^ ^ I, and the second giv(?a/ ss x2 4* ?/* =« 4. Obvi¬ 
ously, the first value is a mininium, whereas the s<wond is a maximum. The curve iw an 
ellipse of semiaxes 2 and 1 whost* major axis makes an angle of 45° with the x axis. 

Example 2. Find the dimensions of the rectangular box, without a top, of maximum 
cajmeity whose surface is 108 in.2 
The function to be maximized is 

/(x,y,r) K xj/2, 

subject to the condition 

xy 4- 2xz 4- 2yz «= 108. (10-10) 

Equations (10-8) and (10-9) yield 

y^ -4- My 4- 2z) « 0, 

X2 4” x(x 4 2z) =* 0, (10-11) 

xy 4- X(2x 4- 2y) »» 0. 

In order to solve these equations, multiply the first by x, the second by 1/, and the last 
by z, and add. There results 

M2xy 4- 4x2 4- 4yz) 4- Zxyz « 0, 

or X(xy 4- 2xz -f 2yz) 4" H^yz *» 0. 

Substituting from (10-10) gives 

108X -f Hxyx ** 0, 


or 


X 


xyz 

72 
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Substituting this value of X in (10~ll) and dividing out common factors give 
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1 -~(«+ 2 *) - 0 , 

I +2*) -0, 

1 _^(2x4.2j,) -0. 

From thp first two of these equations, it is evident that x » j/. The substitution of x « j/ 
in the third equation gives z « IS/y. Substituting for y and z in the first equation yields 
X = r>. Thus, X 0, ?/ =» 6, and 2 *= 3 give the desired dimensions. 

Example 3. Show that the maximum value of the directional derivative of u(x,y, 2 ) 
at any point is given by 

We write the directional derivative (see Eq. (8-9)] in the form 

du 

f{ot,^,y) j- *» w, cos a -f cos /3 -f Ua cos 7 , ( 10 - 12 ) 


where cos a « cos (j,s), cos cos (y,«), cos y « cos (z,«), and maximize f{a,^^y) sub¬ 
ject to the constraining condition 

^(«,/3,7) ^ cos* a d- cos* /3 + cos* 7 — 1 « 0 . (10-13) 

The system of F.(i 8 (10-8) and (10-9) then yields 


— Ux sin a - 2 X cos a sin a « 0 , 

—sin ^ — 2X cos sin /3 * 0, (10-14) 

—u* sin 7 — 2 X cos 7 sm 7 « 0 . 

The case when either sin a, sin jS, or sin 7 vanishes is trivial because of the constraining 

condition (10-13). Thus, the system (10-14) reduces to 

Ux *» 2X cos a, Ui, ^ 2X cos /S, Ut « 2X cos 7 , (10-15) 

and we conclude that 


wj 4- 4 -f Uj 


4X*. 


Thus, X « -h u\ -f Ug, and the substitution of this value of X in (10-15) gives 


cos 0 


Vu| -f a* 4- 4- 4- wf 

On inserting these values in ( 10 - 12 ) we get the desired result 


cos 7 


Ug 


Vu'i 4” wj 4“ uj 
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Examph 4. Find the shortest distance from the origin to the curve y 
in Fig. 9. We apply the procedure employed in Example 1 to minimize 

f{x,y) * I® 4- j/® 

subject to the constraining condition 

4p(x,y) m 7 / - (a; ~ 1)^ « 0. 
Equations (i0»9) now yield 

2x ~ 3X(x - 1)^ - 0, 

2y 4- 2Xy * 0, 


[chap. 3 

« (X - 1 )^ 
(10-16) 

(10-17) 

(10-18) 


which must be solved together with (10-17). The 
system (10-17) and (10-18) has no solutions fora;, y, 
and X 'Phis becomes obvioas on noting that the mmi- 
mum is atbiined at j" « 1 , 7/ 0 , and if we insert 

these values in (10-18), the first of the resulting equa¬ 
tions yields a nonsensical result 2=0 whil(‘ the second 
is true for all values of X. The reason that the La¬ 
grange method this time has failed to give the solution 
is simple. The method depimds on the assumption 
that not both <px and <py vanish at the point where 
the extremum is attained. In our case ^i(l,0) =» 0 
and v 9 y(l| 0 ) = 0 . The moral of tins example is that the Lagrange method yield.s the 
solution of the problem only when the system of Rqs. ( 10 - 8 ) and (10-9) can be solved 
for X. 



PROBLEMS 

1. Work Probs. 1, 2, and 3, Sec. 9, by using Lagrangian multipliers. 

2. Prove that the point of intersection of the medians of a triangle possesses the prop¬ 
erty that the sum of the squares of its distances from tlie vertices is a minimum. 

3. Find the maximum and the rainimura of the .sum of th(j angles made by a line from 
the origin with (a) the cooidiiiate axes of a cartt*sian system, (h) the coordinate planes 

4. Find the maximum distance from the origin to the folium of Descartes x^ ■i' ~ 

Zaxy «*• 0 . 

6 . Find the shortest distance from the origin to the plane 

ax by cz ^ d. 

11. Lagrange Multipliers. We now extend the considerations of Sec. 10 
to cases where the extremum of the function /(xi,T 2 , ... ,Tn) is sought under 
several conditions of constraint. 

We consider first the function 

W ^ f(x,y,u,v), ( 11 - 1 ) 

in which the variables are constrained by two relations 

,pi{x,y,u,v) = 0 , 

<P 2 (x:,y,u,v) = 0 . 


(11-2) 
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If IV takes on the extreme values for certain values of then for 

such values 

df df df df 

— cte H- dy -(fu H- dv = 0, (11-3) 

dx dy dudv 

by the theorem in the preceding section. Also, (11--2) yields two equations; 

— dx H- dy d- du H- dv = Q 

dx dy du dv 


di(>2 d<P2 d<p2 d(p2 

- dx d" -— dy “f" —^ du d* — dv = 0. 

dx dy du dv 


(IM) 


We multiply the first of these by Xi and the second by X 2 , add the results 
to (11-3), and obtain 


Now, if 


d^2\ 



dtpi 

dt(>2\ 

d- X 2 -) 

dx + [ — 

4-X 

-L \-1 

dx/ 

\Sy 


dy 

dy/ 


/df 

d‘p\ 

d(p2\ _ 

d- 

(— + Xi 


d* X 2 ' 

— dv 


\du 

du 


du/ 



/df 

d<pi d<p2\ _ 


+ 

- + X, - 

— + Xo — dv 



\dv 

dv dv / 




a<pi 




du 

dl> 



J(u,v) = 



9 ^ 0, 



dip2 

1 

I 




du 

dv \ 



0. (11-5) 


the values of Xi and X 2 can be found such that 
df d(pi d(p2 

i + x, - + x^— = 0, 

du du du 

df d<pi d^p2 

™ + X, + X 2 —- = 0, 
dv dv dv 


( 11 - 6 ) 


and accordingly (11-5) reduces to the sum of two terms involving arbitrary 
differentials dx and dy. The fact that they are arbitrary enables us to 
conclude that 

d/ , ^ d<pi d<p2 
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The system of six equations (11-6), (11-7), and (11-2) serves to determine 
the parameters Xi, X 2 and the point {x^y^u^v) at which the extreme is at¬ 
tained. 

The foregoing procedure may be extended to cover the case of more 
than two constraining conditions and we obtain the following rule: 

Rule. In order to determine the extreme valtces of a function 


/(X1,X2,...,X„) 

whose variables are subjected to m constraining relations 

(11-8) 

• «,^n) =0, i = 1, 2, ..., m, 

form the function 

m 

»=»! 

(11-9) 

and determine the parameters X,- and the values of Xi, X2, ... 
equations 

dF 

, x„ from the n 

— i=l, 2, 

dxj 

(11-10) 


and the m equations (11-9). 

It should be carefully noted that the applicability of this rule to specific 
problems depends on the possibility of determining the multipliers X». 
The existence of the X* was established above only under the hypothesis 
that J 7 ^ 0, 


Example: As an illustration, consider the problem of determining the maximum and 
the minimum distances from the origin to the curve of intersection of the elUi>8oid 

2* 

-L ^ « 1 

with the plane 

Ax -i- By -h Cz 0. 

The square of the distance from the origin to any point (x,y,2) is 

/ - X* + -f 

and it is necessary to find the extreme values of this function when the point ix,y,z) is 
common to the ellipsoid and the plane. The constraining relations are, therefore, 


2 ^ 

:-h — H- 

a* ^ ^ 


1 


(a) 
and 

{b) ip 2 4" By -+* C2 »• 0. 

The function F f A- Xm + M<P 2 is, in this case, 

^2 ^ 


F - X* -f V® + + Xi -f ^ -b ^ - 1 ) + 2Xa(Ax + -f Cz), 
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where the factor of 2 is introduoed in the last term for convenience. Equations (11-10) 
then become 

ar -f* -r 4 * ■*« 0 , 

a* 


(c) 


y 4- Xi p 4 \tB ■> 0, 


* 4“ Xi -r -4 X 2 C *» 0, 


These equations, together with (o) and (6), give five equations for the determination 
of the five unknowns x, y, z, Xi, and X2. If the first, second, and third of equations (c) 
are multiplied by x, y, and z, respectively, and then added, there results 

( J.2 -.2 

^ ^ “ 0- 
Making use of (a) and (t>), it is evident that 


X] - -( 2 * + V® + r*) ■= -/• 
Setting this value of Xi in (c) and solving for x, y, and z. 


V ^1 — + XjB = 0, 

* (j - ^-) + = 0. 


X24o^ 


rt® — 


/ 


MCc^ 


,2 -/ 

When these values of x, y, and z are substituted in (6), one obtains 

4V B^b'^ 
a* — 7 6^ — / ~ / 

from which / can be readily determined by solving the quadratic equation in/. 


PROBLEMS 

1 . Find the point P, in the plane of the triangle ABC, for which the sum of the dis¬ 
tances from the vertices is a minimum.^ 

2. Find the triangle of minimum jxTimeter wliich can be inscribed in a given triangle. 

12. Taylor’s Formula for Functions of Several Variables. Let f(x,y) be 

a function of two variables x and y that is continuous in the neighborhood 
of the point (a,6) and that has continuous partial derivatives, up to and 
including those of order n, in the vicinity of this point. 

* See E. Goursat’s ^‘Mathematical Analysis,'' English ed., vol. 1, p. 130, for a detailed 
discussion of this interesting problem. 
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If a new independent variable t is introduced with the aid of the relations 

X = a + at, j/ = 6 + |3/, (12-1) 

where a and /S are constants, a function of the smgle variable t will result, 
namely, 

m = f{x,y) = /(a -1- at, 6 -i- 0t). (12-2) 

Expanding F{t) with the aid of the Maclaurin formula gives 

E"(0) , 

F{t) = F{Q) -Jr F\0)t -1- t* -f • • • -f-f, 


2! 

where 0 < 0 < 1. 

It follows from (12-1) and (12-2) that ‘ 


n! 


(12-3) 


dx dy 

F\t) =/x(x,t/)-+/,(x,i/)- 
dt dt 

== fx{x,y)a + fyix,y)p. 

Calculating F'\t) and from this expression gives 

dx 

F"(t) « [fzx(X,y)a + fyx(x,y)^] ”7 + [fxy(x,y)a + fyy(X,7j)^} 


and 


dt 

= fxx{x,y)a^ + 2fxyiT,y)a0 + fyy{x,y)0~, 
F'"(i) = {fxxx(x,y)a^ -f 2f,„Ax,y)a^+/^^x(T,y)fi^} 


dy 

dt 


dx 

dt 


+ [Sxxv{x,y)a^ + 2Sxyyix,7j)a(i Jyyy{x,y)^^\ 




dt 


= Jxxx{x,y)o? + ^xxv{x,y)a^& -f 3/xj,„(x,2/)aj3^ + fvuviXyV)^. 

Higher-order derivatives of Fit) can be obtained by continuing this 
process, but the form is evident from those already obtained. Symbolically 
expressed, 


it) = (a ^ -t- /3 ^)/(x, 3 /) s a 
\ dx dy/ 

{ d dV 


F"(t) 

F"'{t) 


( d a V 

[a — + fi — )f{x,y) 
\ dx dy/ 


df df 

dx dy 

a2---f2ad— 

ax* dx dy dy^ 

d^f 

a*-4 + 3a*^ 


ax* 


a®/ 

ax* dy 
+ 3a(S* 


a®/ 


dx dy^ 


+ , 


a®/ 

ay® 


^ See Sec. 4. 
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a a \“ 97 , , a 7 

dx dy/ dx” dx" ^ ( 


dx” dx^-^dy 

d^f d^f 

dx dy ^ dy^ 


where 


r!(n — r)! 

Since t = 0 gives x = a and y == 6, 

F(0) = /(a,6), F(0) = aUa,b) + 0f,{a,b), .... 

Substituting these expre-ssions in (12-3) gives 

Fit) ^fix,y) =/(a,6) -f [<4Mb) + PUam 

-H [«='/xx(a,6) -b 2a»hy{a,b) -b ^%yia,b)] L + ...+ r„, 

ib I 


r / a a \" 

where - la—-b/S—)j 

n! \ dx dy/ 


f{a -b dat, b -b eaO- 


Since at — x — a and — y — b, the expansion becomes 
f(x,y) =fia,b) +f^[a,b)ix - a) +/„(a,6)(y - b) 

+ [fxxia,b)ix - a)^ -b 2fxp{a,b)ix - a){y - b) + fvv{a,h)iy - b)^] 

+ •■•+«„. (12-4) 

This is Taylor's expansion for a function f(x,y) about the point (a,6). 
Another useful form of (12-4) is obtained by replacing x — a by A and y ~ b 
by k, so that x = a + h and y = b k. Then, 

/(a + h,b-\-k) = f(a,b) + fiia,b)h -f jyia,b)k 


+ -[/xx(o,5)A" + 2Uyia,b)hk + Jyy{a,b)k^] 

M ! 


where R 




3 ^ \n 

^-j_ ^ lly 0/c). 

dx dy/ 


+ ---+ftn, (12-5) 


n! \ dx dy/ 

This formula is frequently written symbolically as 

/(a + h,b + k)= fia,b) + (h^ + k Pjfia,b) 

\ dx dy/ 


1 / a aV 
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In particular, if the point (a,6) is (0,0), the formula (12-4) reads 

Rx,y) - /(0,0) + /^(0,0)a: + /^(0,0)j/ 

+ ^ [/*x(0,0)x* +2Uy{0,0)xy + f,y{0fi)y^\ 

+ * * * + Rny (12-6) 

1 / a ay 

where i?n - —\x - h y —) K^xfiy), 0 < a < 1. 

n! \ dx by/ 


This development is known as the Madaurin formula for functions of two 
variables. It is seen from (12-6) that the Madaurin formula expresses 
the function f(x,y) in a series each term of which is a homogeneous poly¬ 
nomial in X and y. 

The procedure outlined above can be generalized easily to yield similar 
expansions for functions of more than two variables. 

Example: Obtain the expansion of tan“"^ iv/x) about (1,1) up to the third-degree terms: 


y 

f(x,y) « tan * 

x 

/(1,1) = tan-* 1 - 



-2- 


fxz(x,y) » ^ 

II 


V'.i) = 0; 

, , , -2x1/ 





PROBLEMS 

1 . Obtain the expansion for x]/^ -f eosxi/ about (l,7r/2) up to the third-degree terms. 

2. Expand/(x,7/) ~ at (1,1), obtaining three ^^erms 

S. Expand e* cos y at (0,0) up to the fourth-degree terms. 

4. Show that for small values of x and y 

sin y ^ y xy (approx), 

and log (1 -{■ y) ^ y ^ jty --- (approx). 

5. Expand /(x ,?/) « x^y -f x% -f 1 about (0,1). 

6 . Expand Vl — x^ — about (0,0) up to the third-degree terms. 
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7. Show that the development obtained in Prob. 6 agrees with the binomial expansion 
of [1 -- (a:* 4- 


INTEGRALS WITH SEVERAL VARIABLES 


13. Differentiation under the Integral Sign. The fundamental theorem 
of integral calculus states that whenever f(x) is a continuous function in 
the closed interval (a,6) and F{x) is any function such that F'{x) = /(x), 
then 

r f{x) dx ^ Fill,) F(uo) (13-1) 

Juq 

for any two points Uo and uj in the interval. If and Ut are differentiable 
functions of anotlier variable a, so that 


Ui) = Wo(of), Ui = Ui{a), 

the right-hand member in (13-1) is a function of a and the chain rule gives 
dF (ui) dill (inI 

da da da 


Since a similar result holds for differentiation of (13-1) yields the 

important formula 


d 

da 


ru^(a) dui 

/ ^ fix)dx^f(ui) — 
•'tio(a) da 


- /(wo) 


da 


(13-2) 


If the variable a in (13-1) occurs under the integral sign, so that the 
integral takes the form 

^(«) - r f{x,a) dx, (13-3) 

vve can compiite the derivative of <^(«) by calculating the limit of the 
difference quotient as Aa 0 Tins calculation is simple when 

the limits //j are constant. Indeed in this case (13-3) gives 


r^i f^i 

Aip = ^(a -f Aa) — ^(a) = / f{x, a + Aa) dx - / f(x,a} dx 

■'«„ ■'ko 

= T’ [/Or, a + Aa) - /(.r,a)] dx. 

Juo 

Dividing by Aa and taking the limit as Aa 0 give 

<p(a + Aa) — (p(a) nhf{x, a + Aa) — f{x,a) 

(p'(a) ^ lim--= hm /- dx 

0 Aa Aa -* 0 ■'Wo Aa 

(13-4) 

provided the limit on the right exists. 
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If we knew that 

f“i f(x, a + Aa) 


lira /“* 


• fix,a) 


Aa •'^6 Aa 0 

then the right-hand member of (13-4) would give 

r«, df 


, f"! f(^f a+ Aa) - fix) 

ax ~ / bin - ax, 


Aa 


(13-5) 


L 


**o da 


dx 


by the definition of partial derivative. We could then conclude that 
d r^i r«i 

— / f{x,a) dx — I fa(x,a) dx, Uq, Ui const. (13-0) 
da •'“0 -'“0 


Interchanging an integral with a limit operation as in (13-5) is not valid 
in general/ but the equality of (13-5) can, in fact, bo justified when 
is continuous, and hence (13-6) holds in that case. 

Equation (13-2) requires that the integrand be independent of a, while 
(13-6) assumes that the limits of integration are indef>endent of a. 

Wlien the limits and also the integrand depend on a, it can be shown * 
that the correct formula is given by addition of (13-2) and (13-6); namely, 


W, (a) 


dui 

, , , , = /(«!,«) — 

da ‘'wo(a) da 


/. 


fiuo,a) -T-+ , , Ux,«) dx (13-7) 

da -'wo(a) 


provided that u^ia) and Ui(a) are differentiable and f{x,a) and fa{x,a) 
are continuous. The formula (13-7), known as Leibniz's formula, will now 
be illustrated by several examples. 


Example 1. Evaluate — 

da 

(13-6) yields, when a ^ 0, 


I log (x* + a^) dx. Inasmuch as the limiUi are constants, 

™ f log (x^ -f ot^) dx » /" ~ TT^ 
da Jo Jo rr or 


The resulting integral is easily evaluated by the fundamental theorem of integral calcu¬ 
lus, since 

:(2tan-‘3 


jd 

dx ' 


We thus obtain 


2a 

X* 4- or* 


da 


f log (x^ 4- «*) dx » 2 tan ^ ~ I «» 2 tan"”* -• 
Jo a 10 a 


' The reader can verify that 

1 2x 


r* 1 2x 1 

Urn / -- rr ~% dx 7 *^ / lim -- 

0 do log a x^ 4" « Jo <* log a 


2x 


log a X* 4- 


dx. 


•See I. S. Sokolnikoff, “Advanced Calculus/’ pp. 121-122, McGraw-Hill Book Com¬ 
pany, Inc , New York^ 1939. 
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dx, find ip'ix) first by evaluating the integral and then 

0 

differentiating, and also by the Leibniz rule. To avoid confusing the parameter x aji- 
pearing in the limits of the integral with the variable of integration x, we write 

vj(x) « f i'^dt =« }4t^ j « }^x, 

Jo lo 

Hence tp'ix) *■ other hand, the application of the rule (13~2) jdelds 


<p'{x) 


dx 


f 

Jo 


edi = (x^)2 


dx 


thus checking the result previously obtained. 

Example 3. Find d^pjda if ip{a) = / Since the integiand and the limits 

J a 

in this integral are functions <>i a, we use formula (13-7} I'hen 


dip 

da 


/ -« 2x^ 

_ + t-\2) - e-“’(-2a) 

- 

/■*" 2x^ 2e~i + 2ae-“\ 

2-«* 


The integral appearing in this expression cannot be evaluated in a closed form m terms 
of elementary lunctions, but it can be readily computi^d in infinite senes (see Chap 2, 
Sec. 10) 

Example 4 Formula (l3-(>) can .sometimes be used to evaluate definite mteg“j,ls. 
Thus consider 

ip{a} — / - dx, Of > 0. 

Jo log X 

Differentiating under the integral sign, we get 

x^ log X f\ 

ip (a) - / - dx ~ j X^ dx. 

Jo log X Jo 

The evaluation of the integial is ea.sy, and we find 


(13-8) 


p(a) = — ■ 


1 


Integrating again we get 


-h 1 lo a -f I 
<p(a) = log (a -|- 1) 4- e. 


(13*9) 


To evaluate the coirstant c, we note that for a * 0, (13-8) gives ^(0) = 0 while (13-9) for 
tt sa 0 requires that v’(O) = log 1 + r. Hence c =*= 0 We finally iiave 


r 


logo: 


dx « log (a -f 1). 


PROBLEMS 

1 . Find — if <f>{a) ^ / sin ax dx by using the Leibniz formula, and check your 
da Jo 

result by dirc^ct calculation. 
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а. Find ^ if - [”{1 - a cos x)* dx. 

aa Jo 

8. Find ^ if ^a) - T tan"' dx. 
aa Jo oi^ 

4, Find ^ if v>(«) *■ ^tan {x — a) dx, 
da Jq 

б. Find ^ if ^(x) *» f Vx dx. 

dx Jo 

6. Show in the manner of Example 4 that 

« / log (1 + a cos x) dx «* IT log 
Jo 


1 


7. Differentiate under the sign, and thus evaluate 
dx V 

a — COS X (a* — 1) H 




dx 


I 


{a ~ COSX)^ 

if > 1. 


if a* < 1. 


by using 


8 . Show that 


9. Verify that 


I log (1 — 2a cos X 4* dx « 0 if < 1 
Jo 

~ IT log a*^ if a^ > 1. 

1 r 

y = 7 / /(«) f^in k{x — a) da 
k Jo' 


is a solution of the differential equation 

d^y 


dx* 


4 k^y « fix), 


where is a constant. 


14. The Calculus of Variations. Physical lawH can often be deduced 
from concise nmtliernatical principle's to the effect that certain integrals 
attain extreme values. Thus, the Fermat principle of optics asserts that 
the actual path traversed by the light particle is such that the integral 
representing the travel time between two points in every medium is a 
minimum. Also a considerable part of mechanics can be deduced from 
the principle of minimum potential energy, stating that the equilibrium 
configuration of a mechanical system corrcwSponds to the minimum value 
of a certain integral related to the work done on the system by the forces 
acting on it. For example, the shape assumed by a flexible chain fixed 
between two fixed points is such that its (‘enter of gravity is as low as 
possible. To say that the center of graxuty is as low as possible is equivakmt 
to saying that the potential energy of t.he system is as small as possible. 

T)ie problems concerned with the determination of extreme values of 
;integrals whose integrands contain unknown functions belong to the calculus 
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of variations. The simplest of sucli problems concerns the determination 
of an unknown function y = y{T) for which the integral 

I j ' F{r,y,y’)dx (14-1) 

•'^0 

between two fixed points PoC<"(»jyo) P\{ocuyi) is a minimum. The 
function F of the variables x, y, and //' ^ dy/dx is assumed to be known. 

If we imagine that the points Po ^md Pi in the xy plane are joined by 
a sutBciently smooth curve y = /(.r), then tlie substitution of y - f(x) 
and t/' fix) in the inlegrand of (It-l) yields the integral 1(f) wliose 
value, ordinarily, depends on the choice of the curve y — f{x). We ask 
the question: What is ttie e<|uation of the curve y = yix) joining Pq and 
Pi which makes the value of the integral fl4-l) a minimum? To be certain 
lliat this <]uestion makes sense, it is necessary to impose some restrictions 
on the integrand in (14-1) and to specify how the curves that enter in 
competition for tine minimum vmlue of / are to l)e chosen. 

W(‘ shall supposf* that F{c,y,y'), viewed a.s a function of its arguments 
X, y, and y\ has continuous partial deruatives of the second order, and 
we assume that tlien* is a cuiwe y ~ yix) with continuously turning tangent 
that minimizes the integral. We tlcm choo^(‘ tlie competing family of 
curves a.s follows. 1.(4 y — yix) be any lumtion with continuous second 
d(*nvativ(ss which vani.sh(\s at the 
end points of the interval (.ro.Xi). 

Tlien 

yix,,) = 0, yix,) - 0 (14-2) 

If a is a small parameter, 

yix) = yix) -f ar 77 (.r) (14-3) 

represents a family of curves passing 
tiirough iX(),yfi and (. 14 , 7 / 1 ), since 
the minimizing curve y = yix) 
passes through these points and 
^iCro) v(^i) ~ situation 

here is that indicated in Fig. 10 The vertical deviation of a curve in the 
family ( 11 - 3 ) from the minimizing curve is (xyix); it is called the variation 
of 

Now if we substitute y and if from (14-3) for y and f in the integral 
(14-1), we get a function of a: 

J(a) - f ^ Fix, y(x) + (xyix), ?/'(.r) + a7?'(a‘)] dx, (14-4) 

Jxo 

« 0, Eq. (14-3) yields y(x) = y(x), and since y =* y(x) minimizies 
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the integral, we conclude that /(a) must have a minimum for a =* 0, A 
necessary condition for this is 


dl 

da ot«.o 


- 0. 


(14-5) 


We can compute the derivative of 1(a) by differentiating (14-4) under the 
integral sign and get 

r(a) = r — F(xJ,Y') dx, (14-6) 

“'^0 da 

where we have set, 


Y S y{x) H- avix), Y' = y'(x) + ari'{x). 
But by the rule for the differentiation of compoMte functions, 


(14-7) 


da 


dF dY (IF !iY' 
d Y da dY' da 


dF 

dY 


dF 

nix) + n'{x), 

a 1 


BO that (14-6) can be written as 

OF 

•'j'o 1 OY 


OF 

r(a) -/ + 


OF 

OY 


:v'M 


dx. 


Since /'(O) = 0 by (J4-5), we get, on setting tv = 0 in (14-8), 
OF OF 

V 


/•X, (It 

■'^0 L Oif 


dx == 0, 


(14-8) 


(14-9) 


because for a == 0, it is evident from (14-7) that Y = i/(x), F' = y'(x). 
The second term in the integral (14-9) can bo integrated by parts to yield 


rx OF OP rr, a /dF\ 

I r,’{x)dx^--;r,(x) - --(--) 

•'*0 dy dy ij, •'■'(1 dx \dy / 


dF 


d /dF\ 


dx 


= - p nix)— F-d^dx 
Jn dxXdy'J 


dy'. 

since the integrated part drops out because of (14-2). Accordingly, we 
can write (14-9) as 

/•*! r dF d (dF\' 

/ 7,(x) dx = 0. (14-10) 

•'*0 L dy dx \dy / J 

But it(x) is an arbitrary function vanishing at the end points of the interv^d. 
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Since the integral (14-10) must vanish for every choice of i 7 (x), it is easy to 
conclude that ^ 


dF d /dF\ ^ 
dy dx \dyV 


(14-11) 


This equation is called the Euler equation. On carrying out the differentia¬ 
tion indicated in (14-11), we get the second-order ordinary differential 
equation 

dF d^F 




dx dy' dy dy' 


y 


dy'^ 


(14-12) 


for the determination of the minimizing function y{x). 

The general solution of (14-12) contains two arbitrary constants which 
must be chosen so that the curve y - y(x) passes through (ro,2/o) and 
(xuVi). 

It should be noted that the solution of Euler’s equation (14-11) may not 
yield the minimizing curve because the condition (14-5) is necessary but 
not sufficient for a minimum. Ordinarily one must verify wdiether or not 
this solution yields the curve that actually minimizes the integral, but 
frequently geometrical or physical considerations enable one to tell whether 
the curve so obtained makes the integral a maximum, a minimum, or 
neither. 

Similar calculations when performed on the integral 

Hy) = f ' f^{x,y,y',y '',.. ..j/'"') dx (14-13) 

Jxq 

yield the Euler equation 

d iF 

n - r + : , n-(-1)” T5i == (14-14) 

dx dx^ 


The foregoing discussion can also be generalized to the problem of minimiz¬ 
ing the double integral 

I (“) = dx dy, (14-15) 


in which the competing functions u{x,y) assume on the boundary C of 
the region R preassigned continuous values u = ^(s). If it is supposed 

^ The proof m by contradiction. Assume that the function in tlie brackets of (14-10) 
is not zero at some point x « ^ of the intt^rval (xo,xi). Then since it is a continuous 

d 

function, there will be a subinterval I about x » ^ throughout which Fy — Fy> has 

dx 

d 

the same sign as at x =« Choose ij{x) so that it has the same sign as — - 7 - Fy^ in I 

dx 

and vanishes outside this subinterval. For such a choice of ti(x), the integrand in (14:-10) 
will be positive, and thus the integral will fail to vanish as demanded by (14-10). 
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that Fj viewed as a function of x^y^u^Ux ^ &u/dx^ Uy s du/dy^ has con¬ 
tinuous second partial derivatives with respect to these arguments, the 
Euler equation corresponding to the integral (14-15) turns out to be 


Fu 


dFu, dFu 

dx dy 


(14-16) 


A special form of the integral (14-15) is of particular interest in the study 
of the Diriehlet problem, which occurs in numerous applications.^ It is 


/(») = [(mx)^ + + 2/(3:,?/)w1 dx dy, (14-17) 

where/(:r,y) is a known function. The substitution of F = + {uy)'^ 

+ 2fu in (14-16) yields the Poinson equation 

V^a == f{x,y). (14-18) 

It can be shown that the solution of this e<iuatioii,“ assuming specified 
continuous values u = <^(s) on the bo\indarv C of the n^gion, actually 
minimizes the integral (14-17) on the set of all (‘ornpeting functions which 
take on C the same boundary values ip{F). 

Example' What is the equatum of the rurvi* // — yix) for which the area of the surface 
of revolution got by revolving the curve about the x axis is a mmimum? 

The integral to be inimmized in this priiblem js 

I ^ 2Tr f IJ (is ^ 2Tr f yVl -f y'^ (Ix, (14-19) 

Jxi) Jx(, 

It has the form (14-1) with 

F{x,y,y') =» 2rryV\ 4 - ( 11 - 20 ) 

The substitution from (14-20) in the Kuloi equation (14-11), after simple calculations, 


yields 

Vi= 

dxVl y'^ 

or 

yy" -* 2 /'" -1-0. 


This second-order equation is easily solved by setting 1 / » p, y" » p dp/dy (cf. Chap 1, 
Sec. 13). The result is 

X — 02 

y ^ Cl cosh-; (14-21) 

ri 

so tliat the desired curve is a catenary. The mU^gration constants oi and ca in the general 
solution (14-21) must be determined so that the curve passes through given points (xo,l/o), 

* Se^e Chap. 6, Sec. 12, and Chap. 7, Sec. 21. 

* See, for example, 1. S. Sokolnikoff, '^Mathematical Theory of Elasticity,'' 2d ed,, 
Boc. 106, McGraw-HilT Book Company, Inc., 1956. 



SEC. 16] 


INTEGEALS WITH SEVERAL VARIABLES 


269 


PROBLEMS 


1 . Show that the curve of minimum length joining a pair of given points in the plane 
is a straight line. Hint: Minimiz*' 


/' 

■'xn 


Vl + n'-dx. 


2 . Solve Prob. 1 by taking 




dr. 


3, When a bead .slides from n'st along any smooth curve C from tlie point P to a point 
Q on C, the speed v of the beatl i.s c — \vh(‘re h is (lie vertical distance from P to Q. 

^ ' ('hoose P at the oiigin, and show that 

Jp V 


Hence the travel time from /* to Q is / 


the curve for which the travel time i.s a minimum is a cycloid. 

rn y / j ■4.. ,^^2 

4. Consider the integral 7 ^ / - - 

Jxo _ V 

the associated Failer’s t*quation is -h fr — ciP ^ co. Discuss this solution. 
6. Obtain Euler’s equation for tlie int<*gial 


dx, and show that the general solution of 


/(//) 


?/) ^ j -h q{j)>r d 2f{j)q]dx, 


Special eases of this integral arise in the study of deflection of bars and brings 

16. Variational Problems mih Constraints. Occahionally one seeks a 
maximum or minimum vahie of the integral 


/ = f ‘ (It, (15-1) 

’'^0 

discussed in the preceding section, .suhjecl to the condition that another 
integral 

J f ' (/(./%(/,/y') dx (15-2) 

have a known constant value A physical problem of this sort has alread,y 
lK*en mentioned in Sec. II where U was letiuirt'd to find the shape of the 
chain w^hich minimizt's the potential energy^ while, the length of the chain 
is given. This is one of the so-called isoprrimeirir problems of the calculus 
of variations.^ 

It is natural to attempt t.o solv^e the problem I = min subject to the 
condition J = const by the method of Lagrange multipliers. We construct 
the integral 

I + \J ^ f ' [Fix.yy) + XG(j,2/,.v')] dx 

Jxq 

^ Isoperimetric liecause the length (or the perimeter) of the curve is given. 


(16-3) 
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and consider the free extremum of the integral (16-3). The corresponding 
Euler’s equation (14-11) is 

d(F + \G) dd(F + XG) ^ 
dy dx dy' 

and on carrjdng out the indicated differentiation * in (15-4), we get the 
second-order ordinary differential equation containing the parameter X. 
The general solution of this equation, in addition to X, will contain two 
arbitrary integration constants. The integration constants and the param¬ 
eter X must then be determined so that the curve y = y(x) passes through 
the given end points and satisfies the constraining condition (15-2). 

The justification of this procedure is based on an argument similar to 
that used in Sec. 14, where instead of the one-parameter family of the 
neighboring curves (14-3) one constnicts a suitable two-parameter family.^ 
16. Change of Variables in Multiple Integrals. The reader will recall 

that the double integral JJ^^f(T,y) dA of a continuous function J{x,y) spe¬ 
cified in a closed two-dimensional region R of the xy plane is defined as the 
limit of the sum formed in the following way: The region R is subdivided 
into n elements of area and the value of f{x,y) is computed at some 

n 

point (ft,^,0 of the A^,; the sum 21/(Ji,^») AA,- is then formed, and its 

lasrl 

limit is calculated when the number of elements AAt is allowed to increase 
indefinitely in such a way that the greatest linear dimensions of the ele¬ 
ments tend to zero. Thus, 

r ” 

I f(x,y) dA Wm 22/U..’?.) (16-1) 

The calculation of the limit in (Ki-l) is usually performed by repeated 
evaluations of two simple integrals, so that 

r ror^h 

l^f(x:,y)dA I I f{x,y)dydx. (lG-2) 

The limits in (16-2) are determined from the equations of the boundary 
of the region (Fig. 11). The triple inttigral of f{x,y,z) is defined similarly, 
by subdividing the three-dimensional region R into volume elements Ar^ 
and by forming the corresponfling sum. Thus 

n 

]j(.x,y,z)dT= lira At,. (16-3) 

n OO jsarJ 

* Compare Eq. (14-12). 

•See G. A. Bliss, ^'Calculus of Variations,” Carus Monograph, The Open Court Pub^ 
tiflhing Co., LaSaUe, liL, 1925. 
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The limit of the sum in (16-3) is usually evaluated by repeated single in¬ 
tegrations. One can write, for example, 


l^f(x,y,z) dr = j I ...I , f(r,y,z)dzdydx, (16-4) 

JR JXrraa Jy-^hy{x) l(X , J/) 


in which the integration limits are determined from cciuations of the 
bounding surfaces. 

The evaluation of multiple integrals can frequently be simplified by 
making approf)riate cluingcs of the inde¬ 
pendent variables. Thus in dealing with 
double integrals, it may prove advanta¬ 
geous to replace x and // by new variables 
a and v related fo x and y by the trans¬ 
formation 




(16-5) 



with suitable properties 

\V(‘ shall suppose that the functions 
/, in (16-5) havT^ continuous hist paitial derivatives in the region R and 
that the Jacobian of (16-5) 




du 

dv 

(Ir 

dx 

du 

dv 

^y 

dy 


(16-6) 


does imt vanish in tlie region R. In this event, Kqs. (16-5) can be solved 
for .r and y to yield the difT(‘rentiable solution * 


X - iPliUyV), 

y == 


(10-7) 


If u and V are assigned some fixed values, say i/o and t^o, the equations 


Wo = /i(.r,2/), 


Vo = 

determine tw'o curves w^hich will intersect in a point (xo,2/o), such that 


^0 = <pi('f^otVo)y 


yo ~ ^ 2 (wo,*^o)* 

‘ See, for example, J. S. Sokolnikoff, “Advanced Calculus,'’ cha{). 12, McGraw^-Hill 
Book Company, Inc., New York, 1939, 
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Thus the pair of numbers (wo,po) determines the point (a;o,2/o)> m the xy 
plane (Fig. 12). 



If u and V are assigned a sequence of constant values 
(ll2,V2), iUn^Vn), 

a network of curves will be determined that will intersect in the points 

(•ri, 2 /l), {-^ 2 Ah)r (-^3,2/3), C^’n, 2 /n), .... 

Corresponding to any point whose rectangular coordinates are (:r,^) there 
will be a pair of curves u = const and v = const, vhich pas.^ through this 
point. The totality of numbers {v,v) deiinos a curvilin(‘ar (‘oordinate 
system, and the curves themselves are called the coordinatf lines. 

Thus, if 

M = 

V = tan 

X 

the family of curves u = const is a 
family of circles whereas v — const 
defines a family of radial lines, d'he 
curvilinear coordinate system, in 
this case, is the ordinary polar 
coordinate system (Fig. 13). 

In the cartesian xy coordinates 
tile element of area dA == dx dy is 
the area of a rectangle formed by the intersection of the coordinate 
lines X ^ To, X ^ xq + dx^ y ^ yo, y - yo + dy, as shown in Fig, 14. In 
the curvilinear uv coordinates the element of area dA can be visualized as 
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the area of the quadrilateral formed by the intersection of the co¬ 

ordinate lines u ^ Uo^ u ^ 'iMi + du^ V ^ V ^ vq + dv, shown in Fig. 15. 



The oxpreRsioTi for tlio eleineni of area dA in curvilinear coordinates {u,v) 
can be calculated with the aid of Kqs. (16-7), but it is soniewiiat simpler 
to follow the method of Sec. 2, Chap. 5 (see, in particular, Eq. (2-17) of 
that section) to show that 

dA == |/(w,i')| dudv, (10-8) 


where 


dx 

dy 

dii 

du 

dx 

^y 

dv 

dv 


(16-9) 


is the Jacobian of the transformation (16-7). 

The double integral in (lt)-2) can then he evaluated in the iw coordinates 
by substituting in f(x,y) from (16-7) to obtain f\(Pi{a,v), <P2{u,v)] ^ F{u,v) 
and wTiting dA in the form (10-8). Thus, 

lfU,y)dA == / / F(u,v)\J{UyV)\ du dv, (16-10) 


The limits of integration in (10-10) are determined from the equations of 
the boundary of R referred to the iw coordinates. 

Similar considerations apply to a change of variables (x,y,z) in the triple 
integral (10-4) by the transfonnation 

u = /i(a;,.v.2), 

y = h{x,y,z), 


(16-11) 
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with the Jacobian 


J{x,y,z) = 


If the solutions of (16-11) are 


du 

dv 

dw 


dx 

dx 

dx 


du 

dv 

dll) 


— 

— 

— 

9^ 


^y 



du 

dv 

dw 


dz 

dz 

dz 


X = 


{u,v,w), 

y = 

^2 

(u,v,w), 

z = 

<f>z{u,V,w), 


(16-12) 


(16-13) 


the element of volume rfr in the umv system can 1)0 taken as ‘ 
dr = \J{ii,v,w)\dudv due, 


(10-14) 


where J{u,v,w) is the Jacobian of the transformation (16-13), so that 


J{u,v,w) 


dx 

dy 

1 

1 

du 

flu 

du 

dx 

Sy 

dz 

dv 

dv 

dv 

dx 

dy 

dz 

dw 

flw 

dw 


(16-15) 


Example 1. it be required to 6nd the moment of inertia of the area of the circle 

(Fir. 16 ) 

_about a diameter of the circle. It is con- 

p=a cos B voment to introduce the polar coordinates 

X « p cos 

j/ * p sin B, 

so that the equation of the circle becomes 
p « a cos B, 

Calculating the determinant J gives 

_ j 1 cos B sin 0 j 

" t ^ I — p sin 0 p cos I 




dA ^ p dp 


Pia 16 

* Cf. Chap. 5, Sec. 2. 


BO that 




> See Chap, 5, Sec. 1. 


Fig. 17 
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These equations are in the form (16-13) with p « ^ « w, and it is easy to verify 

that (16-14) yields 

dr sin 0 dp dIB d<i». 


Consequently, the substitution from (16-16) gives 

I X dr ^ I I p sin 0 COS <t>p‘^ sin G dp d$ d4> 

JR J ffi »»0 - 6 «i0 • p ««i0 

ira^/16 Sfl 

^ as-SSB, - 

jra'^/6 8 


Thus, 


ra* 


PROBLEMS 


1. Use cylindrical coordinates (r,^, 2 ) defined by 

X = r cos 0, y ^ t SHI 6, z z, 

to compute the moment of inertia of the volume of a right-circular cylinder of height h 
and radius a about its axis. Also evaluate the integral in cartesian coordinates 

2. Compute the expression for dA in forms of u and v it 

X ^ u{l — v), y uv. 

3. ConipuU‘ the expression for dr in terms of a, e, and w if 

j = a(l — r), y ~ ui\ z =* uvw. 

4* Show that in the cylindrical coordinati'^ ol Prob 1, the elianeiit of volume dr =» 
r dr do dz, 

6. Use the cylimliKMl coonlmates of Piob 1 to find the volume enclo^eil by th<' circu¬ 
lar cylinder r — 2a cos 0, the cone z *= r, and the plane z — 0. 

6* Evaluate e~ dy dx, where 72 is the r(‘gion boundiHl by the circle + //“ — 

Jr 

Use polar ciaordmates 

7. P"md the area outside p = < 2(1 -f- cosO) and inside p *= cos ^ 

8. Find the cooidmates of the center of gravity of the an^a l>etween p ~ 2 sin 0 and 
p « 4 sin 0, 

9. ('aiculate the elements of area in the uv coordinate systeiiLS which are related to 
the cartesian coordinate systemxy by means of the following equations of transformation: 

(а) X » u -f a, “ V -p ?>; 

(б) X « au, y ^ bv; 

(c) X « a cos a ■— V sin a, y ^ u sin a -p r cos a; 


where a, 6, and a are constants. Int-erpret your results geometrically 

10. What are the regions of integration m the uv coordinate systems of Prob. 9 if the 
region R in the xy plane is the interior of the ellipse 




1 ? 


tl. Discuss the curvilinear coordinate system defined by the relations 


X “ u -P v, 


y « u — ti; 
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and describe the region in the uv plane corresponding to the square x ^ ^ 2, y 

y ^ 2. 

12. Discuss the curvilinear coordinate system defined by the relations 
u ~ y\ V 2xy. 
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Sketch the curves u * const and v « const. 

13. Sliow that the attraction of a homogeneous 8ph(‘re at a point exterior to the sphere 
is the same as though all of the mass of tin* sphere were concentrated at the center of 
the sphere. Ai«3umc the inverse-square law of force*. 

14. The Newtonian jiotential V, due to a body 7’, at a point P is defined by the equa¬ 
tion V(P) a* / ~ —, wheie dm is the element r>f mass of the body and r is the distance 

Jt r 

from tlie point P to the element of maas dm. Show that the potential of a homogeneous 
sfiherical sludl of inner radius b and outer radium a is 


V « 2ir<r(a- - 5-). if r < 6, 


and 



if r > a, 


where a is the density. 

16. Find th(‘ Newtonian {lotential on the axis of a homogeneous circular cylinder of 
radius a 

16. Hhow' tliat the force of attraction of a right-circular coni' upon a point at its vertex 
IS 27r<r/?(l ”*■ eos a), where h is the altitude of the cone and 2 q i.s the angle at the vertex 

17. Show that the force of attraction of a homogc'iieous right-circular c> finder upon a 
p(»int on it^s axis is 


2irff\h + Vli^ + a= - VOr+h)^ + a^], 


where h is altitude, a is radius, and R is the distance fiom the point to one base of the 
cylinder. 


17. Surface Integrals. A surface is usually defined as a locus of points 
determined by the equation 

2 == (17-1) 

where fU',y) is a continuous function specified in some region of the xy 
plane. Tins definition, however, is too broad to permit one to formulate 
a meaningful (‘onc(‘pt of the surface area. Since mosi surfaces encountered 
in applications are t\vo-sided and piecewise smooth, we confine our con¬ 
siderations to such surfaces only. 

The surface defined by (17-1) is called fimoofh if it has continuous partial 
derivatives dz/dx and dz/dy at each of its points. This implies that a 
smooth surface has a continuously turning tangent plane and hence a 
well-defined normal at each of its pointsd 


^ We recall that the equation of the tangent plane to (17-1) at a point P(a;o,j/o,2k)) is 

> that the direction of the normal at P is determined by the ratios (”—) :(—) ’—1 

\dx/p \dy/p 


(cf. Sec. 10, Chap. 4). 
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The surface is said to he piecewise smooth if it can be subdivided b, 
smooth curves into a finite number of pieces, each of whicli is smooth 
Thus, the surface of a cube is piecewise smooth. 

The surface is two-sided when it is possible to paint it with two differen 
colors to distinguish the sides.^ If two oppositely direcU^d normals PI 
and PN' (Fig. 18) are drawn at a point P of a smooth two-sided surfac 
and P is allowed to move along any path that lioes not cross the edge r 
the surface, the direction of PN can never be ]>rought into coincidenc 
with PN', 



It is intuitively clear that a small element of a smootli surface is nesirl 
fiat, so that a neighborhood of any point on it is well approximated by 
portion of the tangent plane. This observation suggests a procedure fc 
constructing a meaningful definition of the arc^a of a smooth surface. 

Thus, let S' be a smooth portion of the surface N bounded by a close 
curve C (Fig. 19). We shall suppose tliat S' is such that every line paralh 
to some coordinate axis (say the z axis) ctits S' in just one point. If th 
projection C' of C on the xy plane encloscis the region R, we can subdivid 
R into n small subregions AR^ by the families of straight lines parallel t 
the X and y axes. The planes through these lines, normal to the region i 
cut from S' small regions AS[ of areas Aa,. Let, AA, be tlie area of AR 
The projection of Aat on the xy plane is, approximately, 

AA t = cos 7 ^ Aotj, 

where cos a,*, cos /?», and covS 7 j are the direction cosines of the norm; 

* At first glance, it may appear that all surfaces are two-sided, but this is not the cae 
A simple example of one-sided surface, whose boundary is a closed curve, is given 
Sec. 6, Chap. 5. 
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i\r to S at a point of ASl Since * 


and 


cos at : cos : cos 7 ^ = 


\dx/t \dy/t 


cos^ at + cos^ + cos^ 7i == 1 


we have 


C0S7t == 


-I 

rb \^{dz/di)i~+ {dz/dy)l + 1 
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Using the positive value for cos 7 ,, which ainountKS to the choice of the posi* 
tive direction of N^ wc can write 


Act, = sec 7 i A,<4, 



+ 1 A.4.. 


The surface area of can then be approximated by the sum 




and we define the area a of S' by the integral 


<r 




sec 7 dA . 


(17-2) 


I'he integral (17-2) can be evaluated by repeated integrations to yield, 
for example, 


or 





-b 1 dy dx. 


T 3 y considering the projections R' and R" of S' on the other coordinate 
planes, we deduce similar formulas: 



To obtain the surface area of a piecewise smooth surface we need merely 
to add the areas of its smooth pieces. 

The surface integral of a continuous function (p{x,y^z) specified on the 
surface S' is defined as follows: Let S' be subdivided into subregions AS[ 


See the first footnote in this section. 
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of areas and form the sum 


{chap. 3 


Z) Ao’ti (17-3) 

im.1 

where {xi,yijZi) is some point in ASi. The limit of the sum (17*^) as n oe 
in such a way that the greatest linear dimensions of the ASl tend to zero 
is the surface integral of pix^y^z) over S'. It is denoted by the symbol 

da. (17-4) 

The integral (17-4) can be evaluated by repeated integrations. Thus, if 


«= sec 7 dA 




+ 1 dxdy. 


then ip{x^y^z) da =* //. M (ai/) 


where z «= f{x.y) is the equation of S' and R is the projection of S' on the 



dz _ —X 

dx -y/o* — X* — y* 


xy plane. 

We shall consider surface integrals 
in somewhat greater detail in Chap. 
5. 

Example 1. Find the surface of the 
sphere x* ■+- J/® -i- cut off by the 

cylinder — ox -f « 0 (Fig. 20). 

From symmetry it is clear that it will 
suffice to determine the surface in the first 
octant and multiply the result by 4. Now, 


dy -y/a* — X* — y* 


Thus, the integral becomes 


«r 



Idydx 



adydx 

«\/o* ^ X* —^ 
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It is simpler to evaluate this integral by transforming to cylindrical coordinates. The 
equation of the cylinder becomes r ■« o cos S, and that of the sphere 


Thus, 


z » — X* y* a* ~ r*. 


at 



at dr dd 



Example 2. Find the z coordinate of the center of gravity of one octant of the surface 
of the sphere x* -f- y* + Now, 


- ABE -- - 

f da 

Js* 


4ira® 

""F* 


dx dy 


a dx dy 

_ o 


PROBLEMS 

1 . Find, by the method of Sec. 17, the area of the surface of the sphere x* + 1 /* -F 

Unit lies in the first octant, 

2 . P'md the surface of the sphere x^ 4 * y* + 2 !^ o.^ cut off by the cylinder x^ — ox 4 * 

* 0 . 

3. I^nd the volume bounded by the cylinder and the sphere of Prob. 2 . 

4. Find the surface of the cylinder x^ 4- ■“ a^ cut off by the cylinder + Jr® ** o®. 

5. Find the coordinates of the center of gravity of the portion of the surface of the 
sphere cut off by the right-circular cone whose vertex is at the center of the sphere. 

6 . If a sphere is inscribc^d in a right-circular cylinder, then the surfaces of the sphere 
and the cylinder intercepted by a pair of planes perpendicular to the axis of the cylinder 
are equal in area. Prove it. 
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It is desirable to treat direcU^d quantities like force or velocity (which 
are indepcaidtait of eoordinab' sysU^ms) witfioui refc'renee to a set of eo- 
ordinate ax(\s Such a eoordinatc-fnKj treatfuent is made possible i)y th<j 
ana]yti<‘al shortluind known as vector «analysLs. 'FIk^ trajectt)ry of a 
parli(‘l<‘, the dynamics of n‘»,id bmlies, and tfie tla^ory of fluid fioAV are 
nxidily studicni In” v(‘ctor iiietfiods, ris are also su(‘h topics as tli(‘ g(‘ometry 
ot cur\es and surfaces. Introduction of coordinates yic'lds a correspond- 
])(‘t\\(*en v<‘c1ors and setvs of numb(Ts, and this (*orresf)ondenct‘ permits 
th(‘ use of vector riKdliods in the study of linear e(|uations. Such a study 
leads to the eoma'pl of fudlnx, which has proved fruitful in a variety of 
li('lds, rangin'^ from circuit analysis to qtjantuin theory. 


FUNDAMENTAL OPERATIONS 

1. Scalars, Vectors, and Equality. Some (fuantities appearing in the 
study of jiliysical phenomena can be cornplett'ly s]>eeiti(‘d by their magni¬ 
tude alotu' "riiiis, the mass of a IkkIv (‘an ))e d(‘scribed by the number 
of grams, the t(‘mp(‘rature by (l(*gn^(^s on sorin' scak', llie volume by tire 
mimlx'i of (‘ubic units, and so on. A (|uanMtv that (after a suilabk' choice 
of units) (‘an be completely ehara(‘t(Tiz(Hi by a smgk' 
miml)er is called a scalar Th(*re are also quantit i('s, 
ealkvl vectors, that recjuiro for their complete charac¬ 
terization the sjK'cification of dirc'ction as w(‘ll as mag¬ 
nitude An example of a vcH'tor quantity is the dis- 
placeijK'nt of translation of a particle. li a particle is 
displa(‘eti from a position P to a lu'w position P' (Fig 
1 ), them the change in |>ositiou can be reiirestaited 
graphically by the dina-ted line segment PI*' wliose 
length equals the amount of the (jispla(iement and wliose direction is from 
P to P'. Similarly, a forc’e of magnitude K dynes can be represented by 
a line segment whose length is K units and w^hose direction coincidcis with 
that of the force. 
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The initial point P of a directed line segment representing a vector is 
called the origiUj and the representation as an arrow su^sts that the 
tenninal point be called the head of the vector. In many problems the 
location of the origin for any given vector is immaterial, and in such 
problems two vectors are regarded as equal if they have the same length 
and the same direction. Such vectors, which need not coincide to be equal, 
are termed free vectors. In mechanics, it is sometimes convenient to 
specify vectors by giving the line of action as well as the length and di¬ 
rection. Equality of these so-called sliding vectors means that the lengths, 
directions, and lines of action coincide. Again, in the treatment of space 
curves and trajectories one is led to specify the origin of the vector 
as well as its length and direction. Such vectors are termed hound 
vectors. 

To distinguish vectors from scalars, boldface type is used for vectors in 
this book. The length (or magnitude) of the vector A is denoted by | A |: 

|A| = length of A. (1-1) 


Equality is denoted by the usual symbol: A == B. For the most part this 
chapter deals with free vectors, and hence ~ means that A and B 
have the same length and direction. 

2. Addition, Subtraction, and Multiplication by Scalars. If a particle 
is displaced from its initial position P to P', so that PP' =* A, and sub¬ 
sequently it is displaced to a position P", so that P'P" = B, then the 
displacement from the original position P to the final position P'' can be 

accomplished by the single displacement PP" == C. Thus, it is logical 
to write 



Fig. 2 


A + B - C 

as the definition of vector addition (Fig. 2). In 
words, if the initial point of the vector B is placed in 
coincidence with the terminal point of the vector A, 
then the vector C, which joins the initial point of 
A with the terminal point of B, is called the sum 
of A and B and is denoted by A + B == C. Tliis 
is the familiar parallelogram law of addition used in 
physics, and its extension to three or more vectors 


is obvious. The symbol + behaves like the + of elementary algebra, in 


that 


A -f B = B + A, commutative law 
A + (B + C) « (A + B) + C, associative law. 


(2-1) 
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A proof is implicit in Figs. 3 and 4. The associative law enables us to omit 
parentheses, writing A + B + C for A + (B + C). 




It is desirable to give meaning to expressions like 5A, the product of a 
scalar and a vector. In agreement with the meaning of multiplication 
familiar from arithmetic, one defines 

SA^A + A + A + A + A (2-2) 

(Fig 5) and similarly in other cases. By a natural extension of this reason¬ 
ing, /A is defined as a vector whose length is |/| | A| and whose direction is *hat 


A 




5A 

Fm. 5 


of A if t is positive but opposite to that of A if t is negative. One defines At 
by the equation ^ ^ ^2-3) 

It follows that lA == A and also 

s{iA) = (sf)A, associative law, 

{s + OA = sA + fA, distributive law, (2-4) 

^(A + B) == <A + fB, distributive law. 

A vector of zero length is denoted by 0 and termed the zero vector. To 
introduce the idea of subtraction, one defines —A as the solution of the 
equation A + X = 0. Evidently, —A is a vector equal in length to A 
but of opposite direction, so that — A == ( —1)A. As in elementary 
algebra, B — A is used as an abbreviation for B + (~A), 

Since the laws governing the addition of vectors and multiplication of 
vectors by scalars are identical with those met in ordinary algebra, one is 
justified in using the familiar rules of algebra to solve linear equations involv¬ 
ing vectors. 
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Example 1. The point P in 6 divides the segment AB in the ratio m:n. Express 
H in teiins of the vectors A, B and the si^lars m, n. 



(This exemplifies the so-calh'd “head-minus-tail” rule for vector subtraction.) "Jlie vec¬ 


tor AP is m/(m -f n) times X, by the hypothesis and i)y the definition of multiplication 

> 

by scalars. Since R — A -f- AP vse have, finally, 


A+-y-(B-A) 
rn 4- n 


riA + mB 

7!i 4" n 


Example 2. Prove that the medians of every triangle intersect at a jioiut two-thirds 
of the way from each vertex to the opfiosite side. 

Let two sides of th«' timngle be specifitHl by veclois A and B, as in hig 7, so that the 
third side is B - A (cf FAamplc 1). The vector median to the ^idt* B — A is ^ ^ Ihc diag- 



O B 

Fio. 7 


pnal of the parallelogram on A, B; hence this median is (A 4" B)/2 (compare the special 
case w » n of Example 1). If the point Af in Fig. 7 i.s two-thirds of the way from the 
vertex 0 to the side B — A, along this median, then 


SKC. 3] 
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2A+B A+B 

- 


'■J'he vector median to tlie side A is A/2 “ B, again by the head-ininus-tail rule If N 
is at a point two-tliirds of the way toward the side A on this median, then 




Comparison with (2-()) sliows that the two points M, N coincide. That tiio third median 
also liHS the required beliavior follows by interchanging the roles of A and B. 


PROBLEMS 

1. Sketch a vectoi A of length 1 5 in , paialiel to the lowTr edge of your paper and 
having an arrow on its iight-hand end Sketch a second vector B ot length 1 in., making 
an angle of 30'’ with A Now .sketch 2A, 3B, A 4- B, A — B, 2A — 3B, (A -f B)/2 

2. (iive a condition on three vectoi.s A, B, C wdiich ensures that they can form a tri¬ 
angle (Jeneralize to n vectors A, B, C, .. , L 

3. (haphicallv and alg('l»raicaliy, show how to find tw’o vectors A and B if their sum 
S and difftTcnce D are known 

4. Skid ('ll three vectors A, B, C issuing fioni a common point. On your figure show 
the veidors A —• C, B “ A, C — B, and thus illustrate the algebraic identity (A — C) + 
(B A) t (C ~ B) = 0 

6. iVi) While down a vtador ot unit haigth Wrhich has the same direction as a given 
noiizen* vector A (/>) ('sing tin* n^'Niilt (a), wide down a veetor tasecting tlu' angle foi med 
hy two nonzero vectois A, B issuing from a common point 

6 Show that a line fioin a veiti'x of a paialhdogram to the mid-point of a nonadjacent 
'^lde trisects a diagonal. 

3. Base Vectors. Any vector A lyin" in the plane of two noneolliuear 
\(‘('tors a and b can res()l\(‘d info so- 
vailed component dinxdeil along a and b. 

This resolution is aevoinplished hy I'on- 
stnuding th(‘ jtarallelogram whose sides are 
paralli‘1 to a and b (Fig <S). Then one <*an 

VM’ilC 

A == x'a -f ^b, 

\vlH*re X and y are the apf)ropriate scalars. 

If three noncoplanar vectors a, b, and c are given, tluai any vector V 
can be expressed unicpiely as 

V = xa + 2/b + ^c, (3-1) 

where V is t })0 diagonal of the parallelepiped whose edges are .ra, ^b, and 
zc (Fig. 9). The vectors a, b, and c are called the base vectors, and the 
scalars x, y, and z tlie measure numbers. 

An important set of baKse vectors, denoted by i, }, and k, consists of unit 
vectors directed along the positive directions of the x, y, and z axes, re- 



a jca 


F'k;. 8 
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spectively (Fig. 10). It is assumed that the system of axes is a right- 
handed system; that is, a right-hand screw directed along the positive 
z axis advmces in the positive direction when it is rotated from the positive 




X axis toward the positive y axis through the smaller (90®) angle. Because 
i, j, k are mutually orthogonal, the representation 

h ^ xi + yi + zk 
yields the important formula 

\A\^ ^x^ + y^ + 2^ (3-2) 

by use of Pythagorases theorem (Fig. 11). 



Example: If A ** 1 4- 2j 4- 3k and B « — j -h 4k, compute the length of 2A — B. 
Since 2A - B *» 2i 4“ 6j 4* 2k, we have 

12A - B| • (2* 4- 6* 4- 2*)« « 


SBC. 4] 


rONPAMEOTAL OBEEATIONS 


PROBX.BMS 

1. (a) In the form ai -f hj 4* ck write down two vectors of length 5 parallel to the p 
axis, (fe) If A - i -f 2j + 3k, B « i -f j + k, C » i -- k, compute A -f B, (A -f B) -f 
C, B + C, and A + (B -f C). What law does this illustrate^ (e) In (6), find 5A, —2A, 
the sum of these vectors, and the vector 3A. What law does this illustrate? (d) Also 
find 3A, 3B, the sum of these vectors, and the vector 3(A + B). (e) In (6), a certain 
vector D is such that A, B, D can be placed head against tail to form a triangle. What 
is the z component of D? 

2. Sketch the triangle with vertices at the heads of i +• j 4- k, 2j -f k, and 2i 4- j, 
and make the sides into vectors with head against tail. Find the vectors forming the 
sides of the triangle, and verify that the sum of these vectors is zero. 

3. Draw a figure illustrating the inequality |A-fBl < iA| 4- |B|, and by combin¬ 
ing this with (3*2), deduce an algebraic inequality. Can vou give a purely algebraic 
proof? 

4. (a) Ij<"t A, B, C, ... be vectors from the center to the vertices of a regular decagon 
(ten-sided polygon). By choosing a suitable basis i, j and using symmetry, show that 
the sum A 4 B 4- C 4- • * * is zero (h) By picking another basis i', j', with making an 
angle with A, deduce the identity cos 4" cos (0 -j- ir/5) + cos (0 4* 2t/5) 4- • • * 4- 
cos (0 4" Ojr/5) ** 0. 

4. The Dot Product. The dol product ^ of two vectors is defined to be 
the product of their lengths by the (*osine of tlie angle between them. In 
symbols, 

A-B - lAj |B| eos(A,B), (4-1) 

where (A,B) is the angle from A to B. Tims A*B is a scalar, not a vector. 
Geometrically, 

A-B == |A| X (projection of B on A) 

= |B| X (projection of A on B). (4-2) 

Evidently (A,B) can be measured in several ways. However, since cos 6 
== cos (—6) - cos (27r — 6), these different measures all yield the same 


value for A-B. Tlie fact that cos 0 cos {—6) also yields 

A-B = B-A, commutative law, (4-3) 

and one easily verifies the additional properties 

(^A)-B = f(A-B), associative law, (4-4) 

A-(B 4- C) = A-B 4- A-C, distributive law. (4-5) 

For proof of (4-5) use (4-1) to transform (4-5) into 

IAI ] B 4' CI cos V' ** IAI IB1 cos + | A | 1C | cos (4-6) 

where the angles are defined in Fig. 12. Now (4-6) follows from 

IB 4- CI cos ^ |BI cos 4- |C1 cos d (4^7) 

* The terms scalar product and inner product are often used. 
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and (4-7) is evident from Fig. 12, when the vectors are coplanar and the angles are in 
the first quadrant. In view of (4-2) the property amounts merely to the assertion that 
projections are additive, and the extension to arbitrary angles is not difficult, 

For the miitiially orthogonal unit 
vectors i, j, k introduced in Sec. 3 
we have, by inspection of (4-1), 

i-i = j*j ~ k-k = 1, 

(4-8) 

i.j = j.k - i-k - 0. 

Fui. 32 Hence, expanding the product by 

(1-4) and (4-5), we get 

(.ri 4- yi + 2 k)-(.rii -f yo + Zyk) = rxi -f ypi -f zzi, (4-9) 



By (4-1) and (4-0) the dot product gives a simple way to find the angle 
between two \'ectors and, in parti<*ular, to decide wlien two vectors are 
^perpendicular. ImlotMl, if we agree' to n'gard the zero vector as perpen¬ 
dicular to every vector, then from (4-1) 

A‘B = 0 if, and only if, A JL B. (4-10) 


The case in wliich B is parallel to A is also worthy of note, 
when B = A we have . . , . ,i» 


In particular 
(hil) 


Example. 0)mputc‘ the cobinc of the angle bdween A and B if A ** 
B as — i -j_ 2 :k, and find a value of z for which A ± B. 

We have A-B « -1 + 0 + 22 =* 22 - 1 and hence, ]>y (4-1), 


cos (A,B) 


2 2 - 1_ 22 -J_ 

fATTBl " 


The result is zero, and hence the vectors are perpetidieutar, when z = 


4- j 4- 2k, 


PROBLEMS 

1. Given A = i -f 2j -}- 3k, B = — i -F 2j -f k, C == 2i 4- j. («) Find tlic dot prod¬ 
uct of 3i -h 2j "h k with ca^’h of these vectors (Jt) Find A*B, A*C, B -f C, A‘(B 4“ C). 
What law is illuhtrabal? (r) Find 2A and (2A)*B Compare A *3 as found in (/>). (d) 
Find the angle betwetai A and B (e) Find th<* projection of A on C (/) Find a sealai ft 
such that A 4- cvB is perpendicular to A. (^) Find a vector of form ij* 4~ j// 4 k which 
is perpendicular both to A and to B 

2. (a) Show' that i 4- j + k, i ~ k, and i — 2j 4- k are mutually orthogonal. (l>) 
Choose X, y, and z so that i 4“ j 4" 2k, —i 4" zk, and 2i 4~ x j 4-yk are mutually 
orthogonal. 

3. (a) If A*B = A*C for some A 0, is it necessary (hat B » C? IlJustrat^e your 
answer by an example. (6) If A *8 * A-C for every A, is it necessary that B « C? 

6. The Cross Product. Besides the multiplication just considered there 
fs a second kind of multiplication, wdiich yields a product known as the 



FUNDAMENTAL OPERATIONS 


295 


SEC. 5] 

vector prodxwt or cross prodvcL The cross product of A and B, denoted 
by A X B, is a vector C which is normal to the plane of A and B and is 
so directed that the vectors A, B, C form a rip;h1-handed system. The 
length of C is the product of the l(‘ngth of A by the hmgtli of B by the 
sine of the smaller angle between them: 

|A X B| - |A| |B| sin (A,B). 

The expression (5-1) represents the ar(‘a of the parallelogram having A, B 
as adjacent edges (Fig. 13). The student is warned, incidentally, that 




(5-1) does not give A x B; it gives the length |A x B' only. 

Sm(‘e rotation from B to A is opposite to that from A to B, we heve 

A X B ~ —B X A, (5-2) 

so tliat the commutative law does not hold for vector products. On the 
other hand it is the case that 

(/A) X B = /(A X B), associative lav, (5--3) 

A X (B + C) - A X B + A X C, distributive law. (5-4) 

The proof of H^uatioii (5-3) is trivml, and (5—1) i.'. roadily ostublishod if we note that 
A X V IS obtaino<i from the arbitrary vector V by jierloiminf.^ the following three opera¬ 
tions 0 , illustrated in Kig 1 1 

Oil Project V on the plam* perpendicular to A to obtain a vector \ ^ ± A of mag¬ 
nitude !VI sin (A,V). 

O 2 : Multiply Vi by |A1 to obtain V 2 1 A of magnitude |A! iV[ sin (A,V). 

Os' Rotate V 2 about A through tM)'' to obtain V.^ = A X V. 

It IS ('asily check(*d that each of these operators is distributive; that is, Ot(B -f C) ^ 
OtB -f OjC for all vector.s B ainl C Hence the composite opiTator Ch020i is distributive; 
namely, 

03020 i(B + C) = 0 .i 02 ( 0 iB -f f>iC), since 0i is distributive, 

« + O 2 O 1 C), since O 2 is distributive, 

" O 3 O 2 O 1 B -f OJO 2 O 1 C, since Os is dislTibutive. 

Be(*ause O/hOiV ^ A X V for (‘very vendor V, the latter equation yields (5-4). 
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The definitioQs of vector product and of i, j, k lead to 

ixi=.jxj»=kxk*0, ixj>=—Jxi*k, 

J X k »= —k Nj = i, kxi«=—ixk*j. (5-5) 

If A and B are given by their components as 

A = zi + 3/j -H zk, B = zji + yij -f- Zjk 

then expansion by means of (5-3) and (5-4) and simplification by means of 
(5-5) yield 

A X B = i{yz) - zyi) + j(xiz - xzi) + k(xyi - yxi) 
which may be written as a determinant * 


A X B 


i j k 



a= i 

y z 


X z 

+ k 

X y 

X y z 

- j 




Vi zi 


Xi Zi 


2*1 2/1 


xi yi zi 


(5-0) 


Example. Find a vect-or perpendicular to i + 2k and i + j — k, and find the area of 
the triangle with these two vectors as adjacent sides. 

Both questions are settled by calculating the cross product. We have, from (5-6), 


(i -h 2k) X (i + j ~ k) 


i j k 
1 0 2 
1 1 -1 





0 

1 


« -2i -f 3j + k. 


This vector is perpendicular to the given vectors The area of the triangle is half the 
area of the parallelogram: 

Area — 2i-h3j4'k| 34\/l4. 


PROBLEMS 

1, Given A « i 4- 2j -f k, B ■* 3i -f 2j, C - -i + j + 3k. (a) Find A x B, A x C, 
AxB+AxC, B-fC, and A X (B 4* C). Wliat law is illustrated? (6) Find a vf*ctor 
perpendicular to B and C, and verify your answer by use of the dot product, (c) If A, 
B, C have their origins at a common {loint, find a vector perpendicular to the plane in 
which their heads lie. (d) Find the area of the triangle formed by the heads in (c). 

2. Show that the cross product for each two of the following vectors is parallel to 
the third: i 4" j 4“ k, i — k, i — 2j 4- k. What does this indicate about the vectors? 

^ ' The reader unfamiliar with second- or third-onler dcd/erminants is referred to Ap¬ 
pendix A. 
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8« Give an example of three unequal vectors such that the cross product of any two is 
j)erpendicular to the third. 

4. If A X B « 0 and A*B « 0, is it necessary that A « 0 or B « 0? 

6. In refraction at the plane interface of two homogeneous media let A» B, C be unit 
vectors, respectively along the incident, reflected, and refracted rays, and let N be the 
unit normal to the interface, (a) Show that the law of reflection is equivalent to A X If 
*» B X N. (6) Show that the law of refraction is equivalent to njA X N «■ n 2 C X N, 
where ni and n 2 are the indices of refraction. 

6. Continued Products. With the two multiplications previously de¬ 
fined, we can form the products (A*B)C, A'{B x C) and A x (B x C); 
some of the other possible combina¬ 
tions, however, have no meaning. 

For example, (Af^B) x C is mean¬ 
ingless because the two factors in a 
cross product must both be vectors. 

The first product, (A*B)C, denotes 
simply the pr(xluct of the scalar 
A«B with the vector C and may be 
dismissed without further comment. Fia. 15 

By definition of dot product, the 

second expression, A* (B x C), called the scalar triple product, has the value 

A-(B X C) « |A|cosl9|B X C|, (6-1) 

where B is the angle between A and B x C. Since B x C is perpendicular 
to the face of the parallelepiped containing B and C (Fig. 15), and since 
|B X Cl is the area of this face, (6-1) shows that A• (B x C) represents the 
signed volume of the parallelepiped having A, B, C as adjacent edges. More¬ 
over, we have the formula 

Ax A y A. f A ~ lAx “b ^Ay -f* 

A-(BxC)» Bx By B, , B = iBx + jBy + kB,, (6-2) 

Cx Cy Cg .C iCx + jCy + kCx, 

as will now be seen. The expression (5-0) yields 

i j k 

B x C = Bx By Bg = iP + jQ + kP, (6^) 

Cx Cy Cg 

say, where P, Q, R are certain second-order determinants. Taking the 
dot product of L4x + + kA, with (6-3) leads to 

A*(B X C) *= AxP + AyQ + A,P, 

which is the expansion of the determinant (6-2) on elements of the first row. 
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Since interchanging two row^s of a determinant merely changes its sign, 
(6-2) yields the useful relations 

A^(B X C) « B-(C X A) == C*(A x B) 

- -B-(A X C) - -A-(C X B) = -~C-(B x A). (6-4) 

These results as to magnitude are evident from the volume interpretation, 
though further discussion is needed to estahlisli the algebraic sign in this 
way. Because of (6-4) it is customary to write 

A-(B X C) = A-B x C - (ABC). (6-5) 

To evaluate the vector tnpic 'prodvet A x (B x C), let i be a unit vector 
parallel to B and j a unit vector perpendicular to i in the plane of B and C. 
Thus 

B == Bjri, C = Cx^ + A = + -'lyi + ^zk, (6-6) 

where k is a unit vector pierpendicular to i aiul j. so oriented that the three 
form a right-handed system. Since B x C = BxCyk by (6-6) and (5-6), 
we have 


A x (B X C) - -AxBxC\} + AyBxCyi 

- (AxC. + A,Cy)Bxi - AxBxiCxi + C,j) 

- B(A-C) - C(A.B). (6-7) 

Example' Establish the identity 

1 AC BC 1 
(A>CB).(CXD) = |^^ B.d|- 

The expression is the scalar triple* pioduct of A x B, C, and D. InUTchanging the dot 
and cross, as we may by ((i-4), w'c obtain 

(A X B)*C X D « (A X B) X C'D - }(A-C)B - (B*C)AJ*D 

* (A-C)(B-Dj - (B-Cj(A-D), (6-9) 

since (A X B) x C « - C X (A x B) - (A«C) B - (B-C) A by (6-7). 

PROBLEMS 

1. Verify (6-2), (6-7), and (0-8) f)y direct calculation for the sfiecial case A *= i 4* j, 
B - ~i + 2k, C « j -f 2k, D - i -h j + k. 

2. (a) In Prob. 1 find the volume of the parallelefiiped having A, B, and C as adjacent 
edges. (6) Find x such that the vectors 2i -f j — 2k, i -f- j 4“ 3k, and ri 4* j are coplanar. 
H%nt. A certain parallelepiped must have zero volume, (r) State a simple necessary 
and sufficient condition that three arbitrary vectors A, B, C be coplanar. (d) Evaluate 
(AAB) and (ABA), where A, B are arbitrary. 

3. By (6-7) show that A x (B X C) 4- B x (C X A) 4“ C X (A X B) « 0. 

4. Show that (B X C) X (C X A) « C(ABC), and deduce 

(A X B)-(B X C) X (C X A) * (ABC)*. 
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5. The vectors A, B, C issue from a common iK)int and have their heads in a plane. 
Sliow that (A X B) + (B X C) -f (C X A) is perpendicular to this plane. 

7. Differentiation. If for each value of a scalar t a vector R(/) is defined, 
we say that R is a vector function of L In a particular problem t may 
denote the time and R the position vector of a moving point relative to 
some origin (). As in the calculus of scalars, we say that R(0 is a contin uous 
vector fimction of ^ at ^ provided that 

lim R(/) = Rao). (7-1) 

t -4 

The precise meaning of (7-1) is that 1R(0 - R(/o) | becomes as small as 
desired whenever t is sufficiently near /,). 

Tlie c{irt(‘sian components of the vector R(t) are functions of t, so that 
one may WTite 

R(/) - ix(t) + j^(0 + kz(i), (7-2) 

It follows from (7-1) tliat the functions x(t), y(i'), z(t) are continuous if, 
and only if, R(/) is continuous. 

We define the deru alive of R(0 with respect to t by the formula 

(TR R(^ + AO -- R(0 

== hm- (7-3) 

dt At -* 0 At 


The substitution of (7-2) in the definition (7-3) leads immediately to the 
result that R is difTerentiable if, and only if, t, y, z are, and in that case 


c/R 

Yt 


dx du di 
1 r + + 


di 


dl 


dt 


(7-4) 


As in scalar calculus w'e shall write R'(0 for dR/dtj R"(0 for and 

so on. 

Products involving vectors are differentiated by the familiar rules of 
elementary calculus, and the proof of these rules also involves only familiar 
ideas. For example, the formula 


dt 


dB dA 

(A X B) - A X — X B 

^ dt dt 


(7-5) 


follows from 


A(A X B) = (A + AA) X (B + AB) - a X B 
= AxAB + AAxB-PAAxAB 

w^hen we divide by At and let At 0. Of course, the order of the factors 
in (7-5) must be preserved, since the cross product is not commutative. 
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A geometric interpretation of the derivative may be obtained as follows: 
Let the vector R(<) be regarded as a hound vector with its origin at the 
origin of coordinates, The head of R then traces out a space curve as i 
varies (see Fig. 16). The vector 

AR = R(i + AO - R(0 (7*6) 



is directed along a secant of the curve, AR/A^ is parallel to this secant, 
and hence lim (AR/AO is tangent. Thm^ the vector R'(0 is tangent to the 
Space curve R = R(0 whenever R'(0 exists and R'(0 9^ 0. 

To interpret the magnitude jR'(0|, let s be the length of the curve 
from the fixed point given by ^ *= to the variable point given by L 
Assuming R'(0 5*^ 0, we have AR 9 ^ 0 for small A^ > 0, and hence 


As As |AR| As 

A«*~lARl At \AR\ 

Since |AR| is the length of the chord, and since the ratio As/|AR1 of 
arc to chord ^ tends to 1, Eq. (7-7) gives 


AR 

At 


(7*7) 


ds dR 
dt dt 


(7-8) 


when At —» 0. Thus, the vector R'(0 has magnitude |R'| « ds/dt where s 

^ We assume that s increases with t; otherwise a minus sign is needed. The fact tliat 
^arc)/(chord) —► 1 follows from the familiar interpretation of arc as limit of lengths of 
insoribed polygons. It is also possible to take (arc)/(chord) 1 as one of the defining 
properties of arc and proceed, as in the text, to obtain the formula (7*-9). 



nmUAMENTAL OPESATIONS 


BBC. 7J 


301 


ia the arc lengOi along the curve. If R'(<) is continuous, the arc is 
explicitly by 


s - £ |R'(01 * “ £ dt. 


given 

(7-9) 


Introduction of s as parameter instead of t facilitates the study of space 
curves (see Sec. 10). 


In two dimensions the interpretation of R'(/) given here agrees with the resuJta of 
elementary calculus. Let a smooth curve C be represented parametrically by a; « jc(0, 
V " 2/(0, so that the slope is given by 


Slope = - 
dx 


dy/dt 

dx/dt 


t 

x' 


(7-10) 


for x' 7 ^ 0. If the same curve is dt^scribed in tlie form R « Lc + jy, we have R' hr' -f- 
and hence the slope of the vector R' is y'/x' la view of (7-10), the fad that W is 
tangent to the curve agrees with the fact that dy/dx u ike slope of the curve. The formula 
ds/di « |R'l is also familiar; it states that 

which liecoraes « dx^ + when squared and multiplied by (d/)®. 

Physically, one may regard t as time, so that the head of the bound 
vector R(/) gives the position of a moving particle at time L Since the 
velocity is defined to be V = R'(0f foregoing result means that the 
velocity vector is tangent to the trajectory and has magnitude equal to the speed 
ds/dt with which the particle is moving. 


Example 1. The position of a particle at time t is determined by the bound vector 
R(0 « if -f- -f- k sin t. 

Find a vector tangent to the orbit at time and find the speed of the particle at time 

f « 0. 

We have R'(0 *« i + 3jf® 4* k cos t, which is the required tangent vector. At f » 0 
the velocity is R'(0) * i + k, and hence the speed is ds/dt = |R'(0) 1 *= \/2- 
Example 2. If a differentiable vector R(f) has constant length, show that R' is per¬ 
pendicular to R, and interpret geometrically. 

From R*R » const, differentiation yields R*R' -f R^*R =“ 0, whence R'*R »» 0. 
Geometrically, if R is a bound vector of constant length, its head traces out a curve lying 
on a sphere. The tangent to the curve is tangent to the sphere, hence perpendicular 
to the radius vector. Thus, R' X R. 


PROBLEMS 

1. If R(0 « i2( + jSf® + kf®, (a) find the derivative R'(0. (6) At the point (2,3,1) 
find a tangent to the space curve which is traced out by the head of R when R is regarded 
as a bound vector. Hint: The point (2,3,1) corresponds to f «* L (c) If R{<) is a bound 
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vector giving the poeition of a moving particle at time find the velocity and speed of 
this particle at time I »» 1, 

2. (a) Differentiate the vt^clor R(0 » i/ + j sin ^ -f k cos t, compute |ll'(01, ^ind sim¬ 
plify. W R{0 is a bound vector, find the length of the curve tracked out< by the head 
of It as < varies from ( «« 0 to ( « 2. 

3. By writing A*B in component form and differentiating, deduce (A*8)' =» A'‘B •+• 
A*B'. 

4. If Be and A are constant, find a vector tangent to the curve described by the bound 
vector R « Ro + A/. 

3. If R(/) is a l>ound vector giving the position of a moving particle at time t, the 
acceleration is defined to Iwi A »= R"(0. Show that A in constant if R(0 =» Ro + Ri< -f 
Raf'**, where Ro, Ri, and Rj are constant vectors. Is the conv(‘rse true? 

6. Show that (ABC)' « (A'BC) -f (AB'C) -f (ABC '), when A, B, C are difft'rcntiable, 
and write out in determiniint form. 

7. If R — A -f/(OB, where A and B are constant and / is twice diff(irentiabie, then 

R' X R" « 0. 

APPLICATIONS 

8. Mechanics and Dynamics. The work W <lone by a constant force F 
producing a displacement S in the direction of F is |F| |Sj. ]\lor(‘ gener¬ 
ally, if F makes an angle 0 with S, the work is iFj |S| cos 0, and hence* 

ir - F-S. (8-J) 

Because of thivs etjuatiori the dot 
producjt plays a central role in cer¬ 
tain firanches of mechanics. 

To illustrate the a[)plieation of 
(TOSS products, let tlie vector ref>- 
rcsent the angular velocity of a 
rotating body; that is, lei 0 be a 
vector wdiose magniUidti is Die angu¬ 
lar speed in radians per second and 
whose direction is paralk*! to the axis 
of rotation. The positive sense of 
n is (‘boson as that in w'hi(*h a right- 
hand(3d screw would advance if the 
screw w'(Te rotaUid in the same di- 
O rection as the body. Let R be a vec- 

FiG. 17 tor locating any point P of the body 

relative to some point 0 on the axis 
of rotation. It is required to find the instantaneous velocity V of tlie 
point P If the distance of P from the axis of rotation is a, then by Fig. 17 

1V| = |01a - ia| iRIsin (R,a). 

Moreover, V is normal to the plane of R and a and is so directed that 
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Q, R, and V form a right-handed system. Hence, 

V = a X R. (8.2) 

The result is indopeiident of the origin O, for if a new origin Oi is chosen and P is 
specified b}" a vector Ri from Oi, then 

Ri = R 4- S, 

where S is parallel to H (see Fig. 17). Hence H x S = O, and therefore 

nxRi«nx(R-hS)-nxR + ftxs-nxR. 

Another example from dynamics illustrates the (compactness of vector 
notation. Let 0 be a fixixi point in a rigid body, and let a force F be applied 
at a point R of the body, which is located by tiie bound vector R wdiose 
origin is at 0. The for(;e F establisln's a torque or moment T wdiich tends 
to rotat.e the body about an axis that passes through O and is normal to 
th(‘ plane of R and F. ITie magnitude of T is given by 

|T| |R! |Flsin(R,F). 

In addition, R, F, and T form a right-handed system, so that 

T =• R X F. (8-3) 

That the choic'o of 0 is immaterial follows as in the discussion of (8-2). 
Similarly one shows that F may slide along its line of action without 
aff(‘(jting the result; that is, F may be regarded as a sliding vector. 

To illustrate the use of (K-3), we obtain a formula for the so-called center of mass of a 
system of ma!8.s points. The force on a point of mass m in a gravitational field is given 
by mF, when' w is the mass of the point and F is a vector specifying the strength of the 
field at the point m (luestion. W<‘ assume a uniform field, so that F is independent of 
jiosition. From (8-3) 

(R - P) X mF (8-4) 

represents the moment about the point ’ P of the gravitational force on a point of mass 
m at R. If there arc n points of inaascs mi, m 2 , . -iocaUid by the vectors Ri, R 2 , 
..., Rn, respectively, the total momtmt about the point P due to all of them is 

- P) X m»F (8-5) 

It is desired to find a single mass point such that its moment (8-4) reproduces the 
total moment (8-5) for all choices of F and P. Equating the moments (8-4) and (8-5) 
leads to 

[mP - SmJP - mR -f X F « 0, (8-6) 

after rearrangement. Since F is arbitrary in (8-6), the factor in brackets must vanish, 
so that 

P(m — SmO *= mR — Sm,-Ri. (8-7) 

»The vectors R, P, and R, are bound position vectors with a common origin for the 
points /?, Py and Ri, respectively. 
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Tiie fact that P k arWtrary in (8-7) now gives 

m »Zmit SmtRf. (8-8) 

Conversely, (8-8) ensures the validity of (8-7) and hence of (8-6) independently of F and P. 

This discussion was carried out by equating moments only. Equation (8-8) shows, 
however, that the total gravitational force is also preserved, since the mass of the point 
equals the total mass of the collection. 

The point R with position vector 

rriiRi + W2R2 + WnRn 

R =“ Y -p Y (8-9) 

determined by (8-8) is called the center of mass. Evidently the collection 
of points, regarded as a rigid body, wpuld balance about the point R as 
pivot, for the moment (8-4) is zero when P == R, and hence the moment 
(8-5) also vanishes. 

Still another example of the use of vectors in mechanics is given by 
Newton^s laws. Relative to an origin 0, which is regarded as fixed, let 
the position of a particle at time t be specified by the bound vector R(0. 
The velocity vector V is dR/dt, as indicated in Sec. 7, and the momentum 
vector is defined by 

dR 

M « mV = m—» (8-10) 

dt 


where m is the mass of the particle at time t. In this notation Newton^s 
second law of motion takes the simple form 


F « 


dM 

dt ' 


(8-11) 


where F is the force on the particle at time t. 


F - 



If m is constant the result is 
(8-12) 


We shall use (8-10) and (8-12) to derive some interesting properties of the center of 
mass. Suppose given n particles with masses m, and positions denoted by R* (i *■ 1, 
2, ..n), where each rm is independent of L The total momentum of the system 


satisfies 


Xnti 


dRi d 
dt dt 


£m«Ri 


d ZiriiRi 
dt m 



(8-13) 


where m « Sm* is the total mass and where R locates the center of mass [(8-9)]. Thus, 
the total momentum of the system equals that of a single partide which has mass m and moves 
wUh the same velocity as the center of mass of the system. 

If (8-13) is differentiated with respect to there results 


SF< 


m 


d*R 

dt^ 


When we let F* be the force on the tth particle and use (8-12). Since internal forces can¬ 
cel in pairs by Newton's law of equal and opposite reaction, the sum 2F| represents the 
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total exUmal foroe aeting on tiie aystem. Hence the cmler of mau hcu A« tame acoOmt- 
Hon at a particle of matt m acted on by a force equal to the turn of Oie external foreet aeting 
on the system. 

Example 1. Parallel farces F, —F of equal magnitude but opposite direction constitute 
a couple. Find the total moment, and show that it is the same about every point 
Let R be a vector from a given point O to a point P on the line of action of F, and Ri 
to a point Pi on the line of action of ~F (Fig, 18). The total torque is 


R X F + Ri K (~F) 


(R ~ Ri) X F « (PiP) X F. 


Since this is independent of O, the result 
follows. Notice that F and — F must be 
regarded as sliding vectors (Sec. 1) rather 
than free vectors, since the line of action 
is fixed. 

Example 2. A system of forces F^ acting 
at various points Rt of a rigid body is such 
that 2F, 0. If the total torque about 

one point is zero, then the total torque 
about every point is zero. 

From ]S(Ro — Ri) x F* ** 0, say, we arc 
to deduce 2(R — R») x F» =« 0. The two 
equations may be written 

Ro X (ZFO 

R X {2F,) 



Fig. 18 

2:r» X F*, (8-14) 

2R, X Fi. (8-16) 


Equation (8-14) gives SR* x F, «= 0, since SF* « 0, and (8-15) follows. 

Example 3. The moment of the momentum vector M about a point is called the 
angular momentum of the particle about that point. According to the principle of angu¬ 
lar momentum^ the rate of increase of angular momentum about a point equals the re¬ 
sultant torque about that point. Show that this principle is equivalent to Newton’s 
law, F « dUL/dt. 

If A is the angular momentum about the oi igin, then A « R X M, where R gives the 
position of the point. Thus 


Tt 


R X 

R X 


dM 

dt 

dM 

di 



X M 


H- V X (mV) 


R X 


dM 

dt 


The principle of angular momentum dk/di 


R 


dM 

dt 


R X F is therefore equivalent to 
R X F. (8-16) 


If this holds for every choice of origin, that is, for every R, then necessarily dVL/dt ■» F. 
Conversely, if dM/d< F, then (8-16) holds for every R. 


PROBLEMS 

1. Given A •» i 4- 2j k, B - i - k, C « 2i -f- j, with A, B having their origins at 
a common point, (a) find the work done by a force A in a displacement B. (6) Find 
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the work done in a displacement from the head of A to the head of B under a force C. 
(c) Find the work done in the displacement A subject to simultaneous forces B and C. 

2. In Prob, 1: (a) Find the torque about the origin of A due to a force C through the 
head of A. (6) Find the torque about the head of A due to a force C acting through the 
head of B. 

8. Tn Prob. 1: (a) If the figure formed by A and B rotahis about A with angular veloc¬ 
ity ft, find the velocity of the head of B. (h) Find the velocity of the head of A if tiie 
figure formed by A and B rotates with angular velocity ft about an axis parallel to C 
through the head of B. 

4. Two coordinate systems have a common origin at all tim(*8, but the second has a 
vectorial angular velocity ft relative to the first. Show that Vi V 2 -f (ft X R), where 
Vi and V 2 are the velocity vectors in the first and second systems of a point whose posi¬ 
tion vector is R in the first system, 

6. Show that the torque due to two couples is the sum of the torques. 

8, Three points lai^eM 1, 2, 3 have masses 1, 2, 3 and positions 2i -f j + 2k, i — k, 3j, 
respectively, (o) Find the center of mass. (6) Find the total mass 2, 1, and their cen¬ 
ter of mass. From this obtain, again, the cenf(*r of mass for all three. 

7. The vectors A, B, C, P, E give the positions of the vertices of a regular pentagon 
as referred to an origin not necessarily in its plane. Show that their resultant is equal 
to 5R, where R gives the position of the center IhrU‘ Place a unit mass at each ver¬ 
tex, and find the center of mass in two ways. 

8* (a) Show that F*V represents the rate at which work is done on a particle moving 
with velocity V under a force F. (b) Whon the mass is (‘onstant, show that 

(d/dO(w|V|V2) - F-V, 

80 that the rate of increase of kinetic energy equals the rate at which work is done on 
the particle. 

9. Lines and Planes. If R is a bound vector with its origin at the origin 
of coordinates, then the direction numbers x, t/, z are the same as 
coordinates of the head of R, and one may speak indifferently of “the 
point R * or “the vector R.^' This corrt'spondence between vectors and 
points enables us to use vectors in geometry. Here we consider the g(‘- 
ometry of lines and planes, whicli is esf)ocially simple; tlu^ following si^ctions 
are concerned witli general curves and surfac(is. 

Suppose we have given a plane through the point Ro and perpendi(‘ular 
to the constant vector A, If the point R is in the plane, then R ~ Ro 
is perpendicular to A, and conversely (Fig. 19). Hence the eejuation of 
the plane is 

(R~Ro)-A-0. (9H) 


If D is the distance from the point Ri to the piano, then 


D = 1 Ri — Ro 11 cos <91 


|Ri -~Ro||A||cosg| ^ KRi_~ Ro)_^A| 

|A| ■ ” TaI. 


(9-2) 


where & is the angle between A and Ri — Ro. 

Next, consider a line through the point Ro and parallel to a constant 
vector A. If the point R is on this line, then the vector R — Ro is parallel 
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to A, and conversely (Fig. 20). Hence, the equation of the line is 


(R - R(0 X A = 0. (9-3) 

If D is the perpendicular distance from the point Ri to this line, then 



Fxfj 19 Fig 20 


Tn (9~3) the fact that R — Ro is parallel to A may also be expressed 
by writing 

R - Ro - kt, 

w'here t is a scalar. Thus we obtain the equation of the straight line in a 
parametric form, 

R - Ro -f A^ -00 < / < CO, (9-5) 

which is often more useful than (9-3). It is lett to lh(‘ student to deduce 
the cartesian eiiuation t).y setting 

Ro == aoi + voj + -ok, A =- ai + + ck 

in (9-5) and equating componcuits. Eliminating t yields the symmetric 
form 

^ - inf®, 

a h c 

which may also be found from (9-3). 

Example 1 . Show that every equation of form 

ox -f 62 / + r 2 -f d » 0 a, b, r, d const 
represents a plane with A oi -f* T as normal, and conversely. 


(9-7) 
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H E is A general point and Eo a fixed point on the locus (9*7), then writing (9*7) in 
vector form yields 

E.A4*d«0, Eo-A-fd«0. 

Subtracting these equations we obtain (9-1), which shows that the locus is a plane. On 
the other hand, (9-1) itself has the form (9-7), with d « — Ro*A, and hence the converse 
is also true. 

Example 2. Find the equation of a line which passes through the point i — J and is 
parallel to the two planes x -f y "* 3, 2x -I- y -f Sz «» 4. 

Ihe respective normals to the planes are i 4- j and 2 i 4 * j 4 * 3k, and hence the line 
of intersection of the planes is parallel to the cross product: 

(i 4- j) X (2i + i 4- 3k) * 3i - 3j - k. 

Since this vector is parallel to both planes, it gives the direction of the required line, 
and hence the equation is 

E - i - j 4 - (3i - 3j - k)t, ~oo < f < «. 

PEOBLEMS 

1. (o) Find a vector normal to the plane x 4 - 2v 4 * 3i « 1 . (5) Find the angle be¬ 

tween this plane and the plane x 4 - y 4 “ « 4 - 2 * 0. (c) What is the distance from the 
point 3i 4* 2j 4“ k to the plane in (a)? (d) Show that the points i and —j 4 - k lie in 
the plane in (a), (e) Find a vector lying m the plane in (o). fftni- Subtract the vectors 

of (d). (/) Verify that the vector of (e) is normal to the normal found in (a). 

2. (a) Find a vector parallel to the line R«»i4*k4(i4 2j4' 3k)t (/>) Jf R »• ix 4 * 

jy 4 - k« in (a), find x, y, and z in terms of t (c) In (o), find the distance from the point 
i 4 - 2j 4 - 3 k to the given line, (d) Show that the line (a) intersects the line R « 2k 4- 
(Si 4- 2j 4“ k)«. ffirU' Equate the two expressions for R, and consider each component. 
It will be found that all three equations are satisfied by s =« t =» (e) Find the inter¬ 

section point in (d). (/) Find the point where the line in (a) intersects the plane 2x — 
y -|- 32 « 4. Hint: Substitute the result of (l>) into the equation of the plane, find (, 
then find R. 

3 . (a) Find the equation of the line common to the two pianos x -f 2 y 4 * 42 «■ 1 , 

a? 4 - y 3 in the form R « Ro 4* A£. ffml: Let 2 * t, and solve for x and y in terms 
of (, (h) Find a vector parallel to the intersection of the planes by use of the cross prod¬ 
uct as in Example 2. (c) Verify that your answers to (a) and (b) are consistent, (d) 

Find the equation of all planes perpendicular to both planes, (e) Write the equation 
of the line which is parallel to both planes and passes through the point -~3i + k. 

4. (a) In terms of t, find the square of the distance from the point i -f- 2j 4 - 3k to a 
general point on the line R«*3i4'2j4-k4-(i4-j4‘ k)t. (h) By differentiating, find 
the t for which the distance is minimum and the minimum value, (c) Check by the 
distance formula. 

fi. In the form (9-5) obtain the equation of a line perjiendicular to the plane x -h 1 / 4- 
32 ao 0 at the origin. At what point does this line intersect the plane y * 82 + 1? 

6 . If the lines R ■■ Ro 4- Af and R « Ri 4 - are not parallel, then the perpendicular 
distance between them is 

|(Ri-Ro) • A X B| 

lAxBI 

Bini: By a suitable figure show that the distance is the length of the projection of 
El ** Eo on the common perpendicular to the two lines. 
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10, Nomml Lines and Tangent Planes. If a curve C:x m x(t), y « y(<), 
z » z{t) lies on a surface which has the equation 

u{x,y,z) « c, (10-1) 

where c is constant, then 

'*A^{t),y{t),z{t)] E c (10-2) 


identically in t At a fixed point Ro = ixo + jyo + k^o (Fig. 21) we dif¬ 
ferentiate (10-2) by the chain rule (Chap. 3, Sec. 4) to obtain 


du dx du dy du dz 
dx dt dy dt dz dt 


(10-3) 



This may be written as 

n-R'(0 - 0, (10-4) 

where R(0 = ix(0 + iy{t) + k2(0 and where 
du du du 

n == i-h j-h k — at (ro,2/o,2o)* (10-5) 

dx dy dz 

Since R'(/) is tangent to the curve C ly Sec. 7, it follows from (10-4) 
that n is normal to the curve C. And since this is true for every choice of C, 
the vector n must be normal to the surface. The tangent plane is the 
plane perpendicular to n at Ro, and hence its equation is n* (R — Rq) = 0 
by Sec. 9. 

The assumptions which underlie the foregoing result are clear from the derivation. 
We assume u differentiable (so that the chain rule holds), and we assume that not all the 
partial derivatives are zero (otherwise n « 0, and n does not determine a direction). 
The analysis shows, then, that n is perpendicular to every differentiable curve R «• R(0 
which passes through the point Ro and lies in the surface. It is this property that enables 
us to consider n as “normal to the surface.” 
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To illustrate the use of (10-5) we find a normal vector and tangent 
plane for the ellipsoid ^ ^ 2/ + 3^^ == 12 

at the point (1,2,-1) Since ?^ == + 2\/ + 3z^ the partial derivatives 

are 2x, 4?/, and 62 : Ev^aliiating these at ( 1 , 2 ,— 1) and substituting in 
( 10 - 5 ) give the normal vector 

n = 2i + 8j — ()k 

The tangent plane is perpendicular to n and contains the point (1,2,- 1) 
Hence its equation is 

X + 4ty — Zz — 12, 

as the reader can verify 

Introduction of the tangent plane leads to a himplo interpretation of the differential 
(Cliap 3, Sec 3) If the equation of a surface is given in the form z « f{x,y), then 

fix,y) - 2 * 0 

and hence (10-1) holds with u(x,y z) « f{x,y) — z By (10-5) a normal is 

n = i + j - k (KM)) 

dx dy 

SO that the tangent plane has the equation * 

(x - Jo) ^ -f* (v - l/o) “ = « — 20 , (10-7) 

Ox dy 

where df/dx and €^f/dy are evaluated at (Jo,Vo) II set x ~ xo = Ax, v Vo =® Av, 
and 2 — 20 “• in (10-7) (Fig 22), there results 

df df 

Ax -1- Ay = Az 

dx du 



Fig 22 

' The values x, y, z in (10-7) refer to the tangent plane and must not be confused with 
the values x, y, 2 on the surface 2 » /(j,v) 
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The left-hand side is simply the differential d/, and hence the differential for the surface 
^ * /(^,2/) equals the increment for the tangent plane. The definition of differentiability 
given in Chap. 3, Sec. 3, now lias a simple intuitive meaning; namely, f{x,y) ta differen¬ 
tiable tf, mul only if, the surface z »» f{x,y) xs well approximated by its tangent plane. 


PROBLEMS 


1. By use of (10-5) find a vector normal lo the plane Jt:r -f 5?/ 4- cz + d « 0. Com¬ 
pare Sec. 9, Example I. 

2. At the point (2,1,3) on the burfa<*e xi^z ~ jc® -f 2 find (a) a normal vector, (6) an 
equation for tlie tangent plane, (r) an equation for the iionnal line. 

3. Show that the surfaces ji/z •= 1 and -| 0 inters(‘ct at right angles at 

the point (1,1,1); that is, the tangent planes are perpendicular 

4. The tno surfaces 4- fi and 2x^ -t 3^" 4 2“ ~ 9 intersect at (1,1,2). 

Find the angle between the tangent planes .it this point 

6. In Prob 4 find a vector tangent to th<‘ curve m which the surfaces intersect. Hint' 
The rcfiuircd vector is perpruidicular to both normals 


11. Frenet’s Formulas. It was hliowii in S(‘(‘. 7 that the veetor R'(/) 
is tangenl to the sparse curve R ~ R(/) ami has leng:th |R'| — ds/dtj 
where s is the arc along; the curve If (he parameter itself is equal to the 
arc, so that t — s and 


then ds/(H = 1. 

T 


R R(^‘), 

In this ease the vector 
(JR 
(is 


( 11 - 1 ) 


(il-2) 


is a tangent vector of inn I length. 

From T*T ~ 1 w’e deduce that 
(IT/(is is perpendicular to T (Sec. 7, 

Example 2). H(‘n(‘e we may write 

(IT 

~ - xN, x>(), (11-3) 

(Is 

wliere N is a unit vector perpendi¬ 
cular to T and where x is a scalar 
multiplier. The veidor N defined 
by (11-3) i.s ealled the yrirmpal nor- 
mah and the scalar x is ealh‘d the 
curvature. The plaiu* of T and N 

is termed the osrulating plane. We define x =• 0 for a straight line. 

If we introduce a third unit vector B defined by B -- T x N, then the 
system T, N, B forms a right-handed set of orthogonal unit vectors, anakv 
gous to the vectors i, j, k introduced previously. By Fig. 23, 

NxB-T, BxT-N, TxN-B. 



/c 


(11-4) 
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Tbe vector B is called the Mnormal; the figure formed by T, B, N is some¬ 
times referred to as the trihedral associated with the curve. 

Differentiating the relation B « T x N and using (11-3) give 

B' « T X N' + T' X N - T X N' + (xTH) x N - T x N', 

and hence B' is perpendicular to T. It is also perpendicular to B, since 
B*B « 1, and therefore B' is parallel to N: 

dB 

~ - tN. (11-5) 

as 

The scalar multiple r in (11-5) is called the torsion; it measures the rate 
at which the curve twists out of its osculating plane. We define r = 0 for 
a straight line. 

To evaluate dN/ds, recall that N = B x T. Hence 

N' = B X T' + X T - xB X N + tN X T (11-6) 
by (11-3) and (11-5). When we use (11-4), Eq. (11-6) reduces to 

dN 

— =-«T-rB. (11-7) 

ds 

Equations (11-3), (11-5), and (11-7) are known as the Frenet-Serret for¬ 
mulas; they are of fundamental importance in the theory of space curves. 

By equating the lengths of the two vectors in (11-3) and recalling jN | 1 we obtain 

( 11 . 8 ) 

To get a similar formula for t we diiforentiato (11-3), obtaining 

T" « xff' -h x'N « h(-xT - tB) + (11-9) 

by (11-7). Hence, by (11-3), (11-9), and (11-4), 

T' X T" ^ xfi X - htB -b x'N) « h^B - AT, 


eino© N X N « 0. Taking the dot product with T yields 

T-r X T" -A. (IMO) 

If we solve (11-10) for t, express x* in terms of R by (11-8), and express T in terms of 
R by (11-2), there results 

R''R" X R'" 


which is the desired formula. When R 
give, respectively, 


ix(s) + iyis) -b lLjf(s), Eqs. (11-8) and (11-11) 



+ («")* + (*")*. 


( 11 - 12 ) 
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It can be shown that x(s) and r(«) determine the curve completely, apart from its position 
in space.^ 


Since a smooth curve can always be expressed in terms of its arc as 
parameter, the foregoing theory suffers no loss of generality by assuming 
t ^ 8 . In many physical problems, however, it is more fruitful to take 
the time t as parameter, and this possibility is now to be examined. 

Let R = R(/) give the position of a moving particle at time so that 
the velocity is V = R'(0. With v = ds/dt we have ^ 


dR dR ds 

V - ~ --= Tr, 

dt fh dt 


(11-13) 


upon using (11-2). Since (11-3) gives 


we get 



dl 

dJ ds 



- 

== xNr, 


di 

ds dt 

rfV 

dv 

dJ 

dv 

-— 

1 

i 

d- 

[ 

H 

1! 

= T- 

dt 

dt 

di 

di 


+ 


( 11 - 11 ) 


Upon differentiating (1L13) lienee the acceleration vector A = (N/dt lies 
111 the osculating plain, its iangt ntial component has magnitude equal to the 
linear accchration dv/dt, and its normal component ha^ magnitude kv^. 
This is a far-rea(‘hing generalization of the familiar results 


Ataufrontial — b, 


Aiiormal 

r 


for uniform motion in a circle of radius r. 


Taking the cross pruduet ot (11-13) and (11-11) with V rf*plaeed by R' we obtain 
R' X R" hv'^T X N - 


Hence, the direction of the binormiil is given bv R' X R" 

t*ven w hen the parameter is 

i rather than s Since B is a unit vector, v\e have 



R' X R" 

, d 


B == ,-“I 

IR' X R"| 

~d7 

(11-15) 

and similarly, the unit vector T is obtained from 




d 


" \B'\’ 

It 

(11-16) 

Knowing B and T we find N from 



N « B X T. 


(11-17) 

^See, for example, L. P. Eisenhart, “An Introduction 

to Differential Geometry,” 


sec. 6, pp. 25-27, Princeton University Press, l*rinceton, N.J., 1940. 

* In agreement with the results of Sec. 7, Eq. (11-13) expresses the fact that the veloc¬ 
ity is tangent to the orbit and has magnitude equal to the speed. 



314 


ALGEBR.1 AND GEOMETRY OF VECTORS. MATRICES ICHAP. 4 

These formulas eimbJe us to compute the trihedral when the curve is given with an 
arbitrary parameter t provided ds/dt > 0. 

Example 1. Find the equation of the osculating plane at < * 1 for the curve R « d 4- 

4 . fk. 

DiJfferentiation gives 

R'(l) « I 4 4j 4- 3k, R'^(l) * 4j 4* 6k 
Hence by (11-15) the binormai B is parallel to 

(i 4" 4j + 3k) X (4j 4- 6k) « 12i - 6j 4* 4k. 

The osculating plane is normal to B aiui contains the point 

R(l) - i 4- 2j 4 k. 

Hence its equation ivS G.t — 3</ 4 22 = 2, as the reader can verify. 

Example 2 A curve is a plane curve if, and only if, tin* tor.'jion is 'avto 
If the cuive is a plane curve, not a straight lmt‘, tluui the osculating Jilane is widl 
dehned and is the plane of the curve. Hence B is constant, and th(> toision \ani^h<‘s bv 
(1141). Suppose, conversely, that the torsion is zero Then B is constant by (11*5), 
and therefore, using (11-2), 

•^(B-R) = B-'^® = B-T = 0. 
as as 

This gives B*R * const, which us the e^juation of a plane. 

Example 3. Consider tlie cirisilar helix (Fig. 21) with eciuation 

R « ia cos 0 4 ja sin 0 4 0 , p po.sitive const. (11-IS) 

Here, the parametric equations are 


X « a cos fi, 
y — n sin 0 ^ 


z — pO. 

By (11-8) « ~ I'R^'i, wheie primes denote differentiation with respect to the arc param¬ 
eter , 9 . From (11-18) 

dR - —\a sin 0 dO 4 ja cos 0 dB 4 kp dB, 

so t-hat 

» cfRuiR = (a“ .sm’ 0 n"* co.s^ B 4 = (tt^ 4 p^) dO® 

and therefore d&/Js «« 1/V^a^ 4 ^ h, say. Tt follows that 

dR dR do 


ds dB ds 


(—ia sin 4 cos B 4 kp)/i,, 


d^ 

d«^ 

fR 

ds® 


d dR dB ^ , . . , 

» (— IQ cos e — )a sin B)h^, 

do ds ds 
dd'^RdO 

_—^ •— =e \^ia «iu 0 — itt COH d)h^, 

do ds^ ds 
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Fid 2\ 


On TTiakiuii; use of ionnnin (IFS) wo hnii 

= (R' ‘R") « (a^sin* -f a" t)}h^ « 


"<() thnt 

Ac'ooicling to (11-12) th(‘ toision is 


a 

*> t 

<r -f 


~ a sin 0 a vob 0 p 
- a «. (*s ^ ~ a sin 0 0 

a hin —avosO 0 


-P_ 
fF -f- 


If p = 0, WT got a circlo of radius n I>y inspection of (11-18) In this ease r = 0 because 
the curve is a plane curve and k ^ \/a because the radius is always equal to the constant 
a. The behavior as p -■* ^ may be discussial similarly. 


PROBLEMS 

1. Given the curve 7?(0 = ~ 1) -|- 2/j + + l}k* (ci) Find a unit tangent at 

t *= —1. (6) Find the equation of th(‘ normal plane at this point, (r) I'dnd the length 
of the curve from < « 0 to t 1. 

2. (a) Jf R(/) in Prob 1 represents an orbit, find the velocity and acceleration at time t. 
{(>) By use of (a) and Eq. (11-13), find the speed v at time t (c) By use of (a), (6), and 
(11-14) find the curvature x and the principal normal N at time (. 
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3. If the components of lt{0 are second-degree polynomials in then R « R(<) is a 
plane curve, (a) Prove this by use of (11-12) and Example 2. (h) Find the equation of 
the plane. 

4 . Show that (a) the tangents to a helix make a fixed angle with the axis of the helix, 
(b) the principal normal is perpendicular to the axis of the helix. 

fi, (a) Given a particle moving according to the law R(0 ** U -f find a unit tangent 
and a unit normal to the orbit at i « 1. (6) Find the cartesian components of V and A 
at < 1. (c) By use of the dot product and (a), find the tangential components Vt and 

A* of V and A at / « 1, (d) Find ds/dt as jR'(0 |, and from this find dh/dt^. Com¬ 
pare (r). 

In Prob. 5, (a) show that V„, the normal component of V, is zero, and find that of 
A at 1 by use of the dot product and Prob. 5a. (b) By (a) and A„ » | V find the 

curvature of the orbit at ( «* 1. (c) Show that the cartesian equation of the orbit is 

2 / «» z®, and compute the curvature by x « j/'V(l + Compare (6). (d) Explain 

how to find An in terms of A*, Ay, and A#, and use this to check some of your work. 


LINEAR VECTOR SPACES AND MATRICES 

12. Spaces of Higher Dimensions. There is nothing m>^sterious about 
the idea of spaces whose dimensionality is greater than three. In locating 
objects in the familiar three-dimensional space of our physical intuition, 
we have found it convenient to introduce a coordinate system and to 
specify the location of any point in the object by means of throe numbers 
termed the coordinates of the point. Thus, if a cartesian system of axes 
is introduced, w^e can associate wdth each point P an ordered tri{)le of 
labels (x^y^z). 

In dealing with the state of gas determined by the pressijre p, volume 
p, and temperature T, it is often useful to visualize the triples of values 
{PtV,T) as coordinates of points in three-dimensional space, but such a 
visualization fails when the number of variables characterizing the gas- 
state exceeds three. Thus, the state of gas may (and generally does) 
depend not only on the pressure, volume, and temperature, but also on 
the time t. Although a quadruple of values {p,v/r,i) cannot be represented 
as a point in a fixed coordinate system in the three-dimensional space, the 
geometric visualization is of much lesser importance than the analytic 
apparatus developed for coping with the geometric problems. This ap¬ 
paratus (analytic geometry and vector analysis) makes use of the tools 
of algebra and analysis which involve operations on ordered sets of quanti¬ 
ties such as (pfV,Td) or (xi,j 27 -•wliich are valid regardless of the 
number of variables appearing in the set. 

The habits of using the language associated writh geometric thinking 
are so strong, however, that it is natural to continue speaking figuratively 
of A quadruple of numbers (p,i;,T,0 as representing a point in four-di¬ 
mensional space and more generally refer to an ordered set of n values 
(xi^xsif ...,ar«) as a point in nnUmensional space. The values xij X 2 , **ffXn 
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may be of quite diverse sorts; the first three, for example, may be as¬ 
sociated with cartesian coordinates of some point M in three-dimensional 
physical space, X 4 may represent the magnitude of electric charge located 
at Af, Xs may stand for the time of observation, and so on. But whatever 
meaning we choose to attach to the individual values we can speak 
of the n-tuple ixi,X2, - • •fXn) as representing a point P in n-dimensional 
space. 

In three-dimensional space we found it useful to associate with every 

pair of points Pi and P 2 an entity P 1 P 2 which we called a vector a, and 
we have developed a set of rules for operations with vectors which form 
the basis for the algebra and calculus of vectors. 

Although in the initial formulation of these rules we have been guided 
by geometric considerations, we have distilled out geometry by giving a 
set of algebraic laws (2-1), (2-3), (2-4), (4-3), (4-4), and (4-5) which govern 
operations with vectors 

We can continue using the suggestive language of three-dimensional 
vector analysis and say that every pair of points Pi, P 2 in n-dimensional 
space determines a vector a. We further stipulate that in devising the 
rules for operating on such vectors w^e adopt the set of algebraic laws ( 2 - 1 ), 
(2-3), (2-4), (4-3), (4-4), (4-5), which contain no reference to the dimen¬ 
sionality of space, and wo define the vector 0 by the relation a 0 « 0 -f- 
a = a for ever>^ vector a. 

The dimensionality of space, we recall, entered only wiien we made use 
of these laws in those calculations w^hich involved the representations of 
vectors by cojnponents in special coorduiak systems. Thus in Sec. 3 we 
considered a vector in the plane determined by a pair of noncollinear 
vectors and introduced the notion of base vectors and the so-called com¬ 
ponents of the vector along the base vectors. We also saw^ that a vector 
in three-dimensional space can be represented uniquely in terms of its 
components in the directions of three noncoplanar base vectors. These 
remarks suggest that the dimensional!of space is in some w^ay connected 
wuth the number of base vectors needed to represent a given vector by 
components. In providing a generalization of the representation of vectors 
by components in spaces of higher dimensions, w^e need the notion of 
linear dependence of a set of vectors which we develop next. 

13. The Dimensionality of Space. Linear Vector Spaces. The concept 
of linear dependence of a set of vectors ai, a 2 , ..an is intimately con¬ 
nected with the idea of dimensionality of space. 

Definition. A set of n vectors ai, a2, .. an is linearly dependent if 
there exists a set of numbers cri, a 2 , .. i^t all of which are zero, such that 


aiSii -f a2^2 Ofnan 0. 


(13-1) 
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If no such numbers exists the vectors ai, a 2 , ..an are said to be linearly 
indepemtent} 

To get at the geometric meaning of this definition consider two vectors 

a and b whi(*h are like or oppositely 


directed (Fig. 25) Then we can find a 
number k 9 ^ 0 such that 


Fro. 25 


b - kA. (13-2) 


We can write this equation in symmetric form by setting k == —a//?, so 
that (13-2) reads 

era + ^b - 0. (13-3) 


Since neither a nor p is zero, it follows from our rlefinition of linear depend- 
en<*e that two collinear Ac^ctors are always lin(‘ariy dependent Inasmuch 
as every vector b direc1(*d along a can be represented in the form (13-2), 
formula (13-2) serves to define a oJLC-dnnctisfonal linear vector space. We 
observe that every two vectors in sucli a spa('e are linearly dependent. 

If we consider two noncollinoar vectors a and b (Fig. 2(>), then every 
vector c in their plane can be represented in the form 


c — Aqa -|“ A'' 2 b 


(13-^1) 



Fig 20 Fig. 27 


by a suitable choice of the constants Aq and ko. Equation (13-4) can be 
written as 

aa + i3b + 7C = 0, (13-5) 

in which not all constants a, p, 7 are zero. Formula (13-4) determines 
every vector c in the plane of a and b, and it thus defines a two-dimensional 
linear vector space, while formula (13-5) ensures that every three vectors 
in the two-dimensional space are linearly depeiulent. 

If we take three noneoplanar vectors a, b, c (Fig. 27), we can represent 
every vector d in the form 

d = Aia -f A 2 b -f A 3 C, (13-5) 

1 Of. the definition of linear ilependence of a set of functions in Sec. 21, Chap. 1, 
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from which follows the relation 

aa + /3b + 7 C + 5d = 0, (13-7) 

in which a, /3, 7 , 6 are not all zero. 

Equation (13-7) stales that in a three-^limrnsional linear vector space 
defined by (13-()), four vectors are invariably linearly dependent. 

The foregoing discussion indicates a relationshif) betv^’(‘en the dimen¬ 
sionality of a vector s])a<‘e vsilh th(‘ numlier of linearly independent vectors 
re(|uired to represent any vector in one-, two-, or three-dimensional \'ector 
space. 

We gcTKTalize this relationship by saying that in an r-dimensional 
linear vector sjiace (‘V(‘ry vector x can lie n'presc'iited in tin' form 

X = /CiRi + fCriSin, (13-8) 

\\h<'re ai, a 2 , a„ is any set of n linearly ind('})eiah‘nt vectors. It 
follows from (13-8) that in such a space every s(‘t ol more than n vectors 
is Inu'arly df'fKaidetil. 

We sliall call a gu’cn set of n lineally ind(‘p(‘n(lenl vectors lhf‘ base 
verfors (or the hasis} of the //-dinamsional linear vcM-tor spac(‘, and we 
shall t<Tm the nuinlx'rs {/ci,/r 2 ,.. .jAvd the mea'Oirr nnniiKrs associated with 
the basis ai, a 2 , . . •, a»i. 

In Sec. 3 we noted that every ve(‘torV in three-dimensional ^'(H‘tor space 
can b(' represented unicjuely by taking as a basis any set ot llire^e linearly 
ind(‘p(Tident V(M‘tors a, b, c. But we saw that a .spi'cial M‘t of mutually 
orthogonal unit vectors i, j, k wlien es(‘d as a basis great 1 \ simplifies the 
calculations. Idas sugg(‘sts the d(‘sirability ot n‘])res(»nting a vector x 
in the /i-dimensional s})ace in th(‘ torm (13-8) in which tii(‘ base \ectors 
a, are the analogues ot the unit vt‘ctor.s i, j, k 31ie construction of an 
analogous set of iiaso \ectors requires tlie extcmsion ot tlie concepts of 
hmgth and orthogonality to s(‘ts of vei‘tors in a-dinuaisional spaiT'. In 
making Ihi'se extensions we .suppose' that the scalar })rodu(‘t a^b of a and 
b is a real number, and that a-a > 0 unh'ss a - 0 Further the operation 
of .scalar multijilication obeys the' laws (1-3) and (1-3) 

We recall that in three-dimen.sional s})ace two veedors a and b are 
orthogonal if a*b — 0 

and a is a unit vector if 

a«a 1 . 

We (‘xlend the.st' detinition.s to vectors in ?i-diniensiomil space and show 

tiiat when any s('t of ri linearly independent vectors ai, ao, ..an is given, 
one can construct a new^ set of vectors ej, e 2 , ..such that 

e,*e; 0 , iff?^i, 

= 1 , if f *= j. 


(13-9) 
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A set of vectors satisfying the conditions (13-9) is called an orihonormal 
mt 

I^t the set of vectors ai, a 2 , ..an be linearly independent, so that 
the equation 

+ <^2^2 H-h Otn^n = 0 (13-10) 

can be satisfied only by choosing ai = ^2 =*•*=“ = 0. It follows 

from (13-10) that ai 0, for if it were a zero vector, the choice 

ai - 1, a2 = 0, , . Qfn == 0 

would satisfy (13-10) and hence the vectors a^ would be linearly dependent, 
thus contradicting our initial assumption. 

We shall write 

a,*at s |ax|^ 

and call |at| the length of a^ Now denote the product of ai by the recip¬ 
rocal of its length |ai i by ei, so that 


ai 



Since ei*ei = 1, ei is a unit vector. The vectors 

©1, a2, ..., an 

are obviously linearly independent. Consider next the vector 

©2 ~ a2 (a2*©i)©i- 

The scalar product © 2 *©i is 

€ 2 *©! = a2*©i (a2*©i)©i*©i = 0, 

since ©i is a unit vector. Thus ©2 is orthogonal to ©1 and the vector 


f 

©2 



is a unit vector orthogonal to ©]. 

The set of vectors 

Cl, ©2, aa, ..an 

is linearly independent, and we construct t)ie vector 

©3 = aa — (a 3 '©i)ei — (a3*e2)e2 

which is orthogonal to both ©j and © 2 . The vector 

/ 



i» a unit vector, and the set of vectors 
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S4> * • •» 

is a linearly independent set. We continue the process by forming 

64 « a 4 — (a4*ei)ei — (a 4 ‘e 2 )e 2 ~ (04*63)63 

which is orthogonal to Ci, 62 , and 63 , and normalize it by dividing it by 
1 64 1. The set of vectors 


®^n 

is linearly independent, and a continuation of the procedure yields after 
n st/cps the desired set of orthonormal vectors 

®2> * * - > ®n* 

14 . Cartesian Reference Frames. When the base vectors i, j, k of Sec. 3 
are oriented along the xyz axes, the coordinates of their terminal points are 

i: (1,0,0), 
j: (0,1,0), 
k: (0,0,1). 

By analogy we can say that when a set of orthonormal base vectors 61, 
62 , ..., en is oriented along “a cartesian reference frame in n-dimensional 
Fvu<didean space/' the terminal points of the base vectors have the co¬ 
ordinates 

61: 

62: (0,1,0, ..lO), 

63: (0,0,1,.. .,0), 

6n* (0,0,0,...,!). 

In this reference frame every vector x has the representation 

X = xi^i + :r2e2 H-h (14-1) 

where the j, are the components of x. 

On making use of the distributive la^v of scalar multiplication, we find 
that 

x*x = X? + 3-2 +-f- ( 14 - 2 ) 

since e.-e, = {,,, ( 14 - 3 ) 

where the symbol i,y, the Kronecker delta, means 

Sij - I, if t = j, 

« 0, if X j. 
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From (14-2) we conclude that the length |x| of the vector x is given f>y 
the formula 

1X I = + .r2 + * * + ^*n • 


This is tho formula of Pijthafjorafi in n-dnncnsional Euclidean f^pace. 

Also, if 

y ~ Ih^l ■+■ y'2^2 + • • * f l/n^nt (14-4) 

then on forming the scalar product x-y wc find 

X-y = Xiyi + X2I/2 d-h -CnlJn, (M-o^ 

wliich has the same structure as formula (f-9). 

For the sum of two vectors x, y uith conifxanmts 

x: 

y: (.Vi,/y2.*. ,2/nh 

we have the vector x + y with components 

X + y: (Xi + Pu ^2 + Z/i, . ♦ M + .Vn), (14-0) 

and for the product of x by a scalar a, 

ax: (rtXi,aX2,. . .,aJn). (14-7j 

If we have two vectors x and y in Euclidean three-dimensional space, we 
have a useful inequality 

(x-y)“ < (x-x)(yy) (14-8) 


which follows dire{*tly from the fxict that 


cos^ d 


(x*y)^ 

(x*x)(y'y) 


< 1. 


We show next that the formula (14-8), known as the Cauchy-Schwarz 
inequaUtyy is valid in an n-dimensionai luu'lidcan spa(‘e 
Indeed, 

(x-y)" (x-y)=^' 

x*x — 2-d— — 


(x-x)(yy) - (x-y)^ = y-y 


y.y 


y-y 


= |y|2 



xy 
y - 

y.y 


> 0, 


which proves the inequality (14-8). Wo note that the equality sign in 
(14-8) holds if, and only if, y = 0 or x = ay for some scalar a. 

The formula (14-8) enables us to establish the result 


ix-l-y|<lxH-|yi, 


(14-1)) 
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analogous to the ^^triangle inequality’' of Prob. 3 in Sec. 3. We compute 
|x + y|* = (x + y)'(x + y) = XX + yy + 2x-y 



< lxp+|y|=' + 2|x-y|. 

(14-10) 

But from (14-8) 

(x-x)(yy) > |x-yp, 


so tJuit. 

|x|-ly|>lx-y|. 

(14-11) 


The substitution from (14-11) in (14-10) yields 

!x + y|^ <lxp+|yp + 2 lx|.|y| = {|xl+|y|)^ 

and on extracting the square root we get the ineiiuality (14-9). 

In quantum mechanic.^ jhkI hi s(‘Vontl other branthes of })li,yhie.s it is necessary to 
coiisiUo! onicrt'ti si'ls of (Ofnplei nunilxTS Such sets can he viev\€*<l as 

components of a v(‘etor x m an n-dimensional complex vector spare. For the definition 
of addition of tnu complex vectors x, y uith components 

x: (.ri,T 2 ,.. .,T,*), 

y- (Vl,//‘>, P/n), 

we can take ftirmula (14-b) and define the multiplication by a scalar a (real or complex) 
liy (14-7). To mnk(‘ lh(‘ length ,xl of the* complex vector x rixd, "e aiiupt as th(* defini¬ 
tion of scalar proiluct of x and y the formula 


X • y * JjVi + hy2 + • • f fnt/n, (‘ 4 - 12 ) 

m which Ji denotes the conjugate of tlie comi>lex nuinbi'r x^ This foimula specialize.s 
to (14-5) when the eomponents of vectors are real, since for real numbers j, = ji. We 
note from (14-12) that 

yx =* Jp/I T i2?/2 H-h XnVny 


so that x-y y x, 

since the conjugate of the sum of complex numbers is equal to the sum of their conju¬ 
gates and the eonjugab* of the piotlind is the product of lh(‘ conjugates 
I-'ormula (11-12) yu'lds 

x-x = JciXi d- X2Jr2 H-k JnXn, (14-13) 

so that |x| =* y/x'X is a real number 

Th(> definition of linear independence of a set of complex vectors is that given in See. 
13 %vh<ue the constants txt are now in the field of comiilox numbers. 


PROBLEMS 

1. If one starts with the definition of a vector x as an n-tuple of n real or complex 
numbers (jri,J- 2 , • • - r^n) and uses for the definition of sum and product the formulas 

X -f y; (Ji 4* ?/j, ..., Jn + Vn), 
kx: (fcxi,. . 

n 

x*y S Ai/o 

txml 
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(x+y)-x - x-z+s-z, 

*.(y + Z) - x-y +*-*, 

(kx)-y - £(x-y), 
x-(*: 7 ) = k(x-y). 

S. Prove that if a^'\ a®, .... a^"^ is a set of n linearly independent vectors in a 
complex n-(limensionaI vector space, then the only vector x orthogonal to each of the 
vectors is the zero vector. 

8 . Prove that a set of mutually orthogonal vectors is always linearly independent. 

4. Modify the proof of orthogonalization in Sec 13 so that it applies to a set of linearly 
independent complex vectors. 

16 . Stimmation Convention. Cramer^s Rule. In dealing with expres¬ 
sions involving sums of quantities it is often useful to adopt tlie following 
summation convention: If in some expression a certain summation index 
occurs twice, we omit writing the summation symbol 2 and agree to sum the 
terms in the expression for all admissible values of the index. 

3 

Thus in a linear form X) summation index i appears twice under 

j«. 1 

the summation symbol 2 , and we shall write a,x, to mean + 02 X 2 
3 

4“ 03 ^ 3 . The symbol 23 + a 22 + CI 33 will be written simply 

as a„. Again, a double sum 
3 3 

53 23 + ai2XiX2 + 013 X 1 X 3 + 021 X 2 X 1 + 022 X 2 X 3 

%oml 

+ 023X2X3 + 031X3X1 + 032X3J2 + 0330*3X3, 

which has two repeated summation indices i and j under the summation 
symbols, will be written as 

o»yX,jj. 

The range of admissible values of the indices, of course, has to be specified. 
Thus, the expression 

1,2,3, 

ij = l, 2 , 3 , 4 

represents three linear forms 

OiiXi + O12X2 + O13X3 + ai4.r4, 

O21X1 + O22X2 + 023X3 + 034X4, 

031X1 + 032X2 + 033X3 + 034X4, 

corresponding to the three possible choices f » 1, 2, 3 of the free index i, 

^ The summation index j is often called the dummy index because it can be 
replaced by any other letter having the same range of summation. The 
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dummy index is anali^ous to the variable of integration in a definite 
integral, which can also be changed at will. Thus 

QfjXiXj = dkr^k^r) 

it being understood that the indices i, j, k, r range over the same sets of val¬ 
ues. Unless a statement to the contrary is made, we shall suppose that 
the indices have the range of values from 1 to n. We shall thus write 
formulas (14-4) and (14-5), for example, as 

y *= 


and (14-13) as 


x-y - x^yi, 


x*x = £^x^, 


We shall make use of this summation notation among other places in writing 
formulas for the product of determinants and for the expansion of deter¬ 
minants. 

We recall that a determinant 



an 

ai2' * 

'Clin 

a^j 1 — 

021 

022* • 

*02n 


(^nl 

0„2’ • 

* Clfin 


of order n represents an algebraic sum of n! terms formed from the ele¬ 
ments Uij in such a way that one, and only one, element from each row i 
and each column j appears in each term.‘ 

The product of two determinants | | and j htj |, each of the order n, 

can be written as a single determinant |r,;| of order 7i in which the ele¬ 
ment Ctj in the fth row and jth column is 

Cij ~ CLtkifjk ~ T* Cli2^j2 -f- * ' ' (15-2) 

Inasmuch as the value of the determinant |6tj| is unchanged when its 
rows and columns are interchanged, the value of the determinant 

|c,'j1 = |a,;| \bij\ 

with the elements (15-2) is the same as that of the determinant \cij\ with 
the elements 


Cij (^ikbkj — ^ibij "T -f- • • * -}- 0>inbnj’ (15-3) 

^ A discussion of determinants is contained in Appendix A. 

» Since the number »is fixed, the term Ot»6,« does not represent the sum of terms with 
respect to n. Here n knot a summation index. Of. Appendix A, Formula (1-10), 
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If the oof actor ^ of the element a^j in the determinant (15-1) is denoted 
by .4,-^, we can expand a,y in terms of (he cofactors of elements in any 
row or oc4umn of the doU'rminant. A reference to (1-5) in Aj^pendix A 
will show that the following formulas include the Laplace developments 
of (15-1): 


1 ik 
(1 11 A ki 


hjkdy 


(15- 


( 15 - 5 ) 


where hjk is the Kronecker delta and a stands for the value of |ro^|; for 
if in (15-4) k j, the expn^ssion represents IIh' sum of products 

of the elements in the jth (*oluinn by the cotactors of th(' (‘lements in th(' 
Hh column. The value of such a sum is zero, sinct* i( r(*pres('nts the 
expansion of a det(u*iniiiant with two hk(‘ columns. If j — /:, the sum 
UtjAik i‘'5 tlie sum of piodmUs ol (he (‘kumuUs in {h<‘jth column by the co- 
factors of tliosi' (‘lements, vit^dmg th(‘ value' a ~ \(iij ‘. Similar statements 
apply to (15-5) if we replace the wonl “column" by “row.” 

Formula (15-4) enables us to gixe a compar*! derivation of Cramer’s 
rule for solving a system ot n linear (Mpiations 


ciijXj = hi (] 5 -(>) 

in n unknowns .r^ 

We multiply both members of (lo-b) liy the (‘ofactors A,a. and sum with 
respect to i. We get 

n/ji-4~ A 

But by (15-4) this is 


bjkaxj = A^ih,, 


Tlie sum hj^Xj = and we cfinchule tliat 

Ajklh 

ta =- 

a 


(15-7) 


whenever a 0. The numerator in (15-7) is tlu* dotiTrninant obtained 
by replacing the elements in the Ath columii of liy the h,. 'Die rcudm- 
finding the foregoing calculations too concise will find a more I'xpansive 
discussion in Sec. 2 of Appendix A. 


PROBLEMS 

1 . Write out the following expressions in full. 

(a) Vu; (h) (r) a^bj/ = 5^*; (fl) (0 / dr,, (/) dr/; (f/) u,,; (h) 

0.1 A, 

ai -* (t) Qyjatk Sj/,; {j) (i,,r,Xj] (A) (!) a^r/, -= h, Tlie syinhots 5,, 

dXj 

denote the Kronecker deltas. 

* See Appendix A. We recall that tin* cofactor of atj is the signed minor M^j of the 
element the sign being ( —1)' 
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2. Write out the deU^rmiuanis represenU*(l by the expansion ttisAta and where 

A i; is the cofactor of the element Otj in | atj |. Also write out the determinants represented 
by <Jt 2 At 2 and an,At3. 

S. Expand the doterininauts: 

^^12 Ui3 ai4 
0 022 «23 ^24 

A A 

0 0 038 034 

0 0 0 044 

111 Oi 0 0 

(c) } I Jo ; (d) 02 ^>2 0 

x'i jrj J3' 03 63 Cl 

4. Multiply th(‘ di'liTininant (h) m Proh 3 by the determinants (r) and (d). 

16. Matrices. In this section we intro(hj(‘e the concept of a matrix and 
discuss some rules of ojieralion with matrices which are of value in the 
study of luK'ar transformations. 

An ni X f( Tuntrix an ordered set of mn (juantitu's a^j arranged in a 
re(‘tangular array of in rows and n columns. Jf 711 — n, the array is called 
a sqiKirc matrix of onhu’ n. 7'lre quantities a,, are called the elemtnU of 
the matrix. Tlius, a matrix Is an array 

On 0i2* ’ o/j,, 

0-1 (7 >2* 02 ,, 


/'>2 ' * ’ ^^nin 

v\h(Te [larentheses are used to emdo^e (h<‘ array of ehmients. We shall 
d(‘not(' iua(ric(‘s by capital lott(‘rs, or when it is desired to exliihit a typical 
(dcmimt of the matrix (Kbl), we shall write 

If the ord(u* of the elennmts m (!<'>-]) is changed, or if any element is 
clianged, a different matrix nssults For exanpile, a tnjile of values (a|, 
02 /i,ii r(‘pr(‘s(‘nting tht^ cartesian coordinates ot a point is a 1 X 3 matrix. 
Jf ui X 02 , the matrix ia^dhdi.i) obviously represents a difierent point. 

Two 711 X n matn(*es A ~ (o,^) and H - (b,j) are said to be equal if, 
and only if, Uij = />,; for eaidi ? and j. d'liat is, A ~ li only w’hen the 
elements in like positions of th(‘ two arrays ar(‘ eciual. 

We define the sum A + /> of two m X 11 matrices A = (o,;), B == (hij) 
to be tlie array 

A + B ^ Uh, + h,,), (l(b2) 

and their differeniT A — /i to be the array 

A — B - ia,, — b,j). 

We shall agree to say that the ]>roducl of the matrix A — (a,j) by a con- 
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giant A;, written kA^ is a matrix each of whose elements is multiplied by fc. 
Thus kA =« (katj). 

If we have an w X n matrix A and an n X p matrix we define the 
product of A and written AB, by the formula 

AB^ (a,(16-3) 

where, as agreed in Sec. 15, the repeated index j is summed from 1 to n. 
Thus, the product AB is an m X p matrix, and we can multiply two 
matrices only if the number of columns m the first factor is equal to the 
number of rows in the second. 


Example 1. If 


and 


AB ‘ 




^14-2 


and 
0-1 2 + 1 


y(l)(2) + (0)(0) 4- (2)(1) (1)(-1) -f 


V(0)(2) 4 (5)(0) + (6)(1) (0)(-l) + 




-1 ] 

B - 

r 

1 J 


\i 

-2 -] 

\ 


-1 3V 

Oh 


0 5] 



3 5/ 

I- (2)( - 

2) 

(1)(1) 4 

‘ (3)(- 

2) 

(2)(U 4 (■ 

{6)(- 

2) 

(0)(1) + 


!)■ 


(0U2) f-(2)(-l)v 


i5)(2) 4 (6)(~1)/ 



Also, if 


/I 0 2\ / 2\ /(l)(2)+ (0)(3) +(2)(-lK / 0\ 

AC»(2 -1 3)( 3)«((2)(2) + (-1)(3)4(3)(-1))«{-2). 

\0 5 o/V-l/ \(0)(2)+ (5)(3)+ (6)(-l)/ \ 9/ 

We observe that the rule (16-2) for the addition of matrices reciuires 
that A + B — B -jr A, but it does not follow from (16-3) that the order 
of factors in the product AB can be interchanged even when the matrices 


are square. Indeed, for 


A.r 

.„d ») 

\1 0/ 

\ 0 1/ 

the rule (16-3) gives 


/ 0 IN 

k /O -1\ 

AB = { 

while )• 

\-l 0> 

' \1 0/ 


Thus, the multiplication of matrices, in general, is not commutative. 



UOTAB VECTOR SPACBSS ANP MATRICES 


329 


SEC. 16 ] 

However, if we have two square matrices of order n which have zero 
elements everywhere except possibly on the main diagonal, then it follows 
from (16-3) that 




0 


0 


0 ••• 


h 

0 • 

• 

ja\hi 

0 

•• 

02 * • • 

0 


62 • ■ 


0 

^ 2^2 


0 ••• 

aj 

\o 

0 • 

.. J 

0 

0 • 

• • 0,nhj 


Such matrices are called diagonaL 

Thus for two diagonal matrices A and 

AB « BA, 


A diagonal matrix in which all elements along the main diagonal are equal 
is called a scalar matrix. A particular scalar matrix 


/I o - 

0 1 ••• 0 

\o 0 • • • l/ 


(16-4) 


is called the ideniiiy (or unit) matrix. 

We note that if I is the identity matrix and A is any square matrix, 
then ^ 

IA == A / =- A. (16-5) 

By analogy with the rules of ordinary algebra, we define the zero matrix 
0 to be the matrix such that 


O + A = A. 


It follows from (10-2) that all elements of the zero matrix are zeros. We 
obs(Tve that the product of two matrices may be a zero matrix even when 
neither of the factors is a zero matrix. Thus, if 


A = 


A 1 
0 0 
^0 1 


and 




then 


/O 0 Ov 
AB « ( 0 0 Oj- 

Vo 0 (y 


* More generally we can show that if AX » XA for every matrix A, then X is a scalar 
matrix. See Prob. 6. 
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If the matrix is square, it is possible to form from the elements of the 
matrix a determinant whose elements have the same arrangement as those 
of the matrix. This determinant is called the determinant of the matrix. 
From any matrix, other matrices can be obtained by striking out a number 
of rows and columns. Certain of these matrices will be square matrices, 
and the determinants of these matrices are called determinants of the 
matrix. For m m Xn matrix, there are square matrices of orders 1, 2, 
..., p, where p is equal to the smaller of the numbers m and n. 

Example 2. The 2X3 matrix 

4 m 

\Cf21 <*23/ 

contains the first-order square matrices (an), (ai 2 ), ( 023 ), etc., obtained by striking out 
any two columns and any one row. It also contains the scH'ond-ordcr square matrices 

( <*11 <ii2\ /an ai 3 \ /ai2 ai 3 \ 

021 <*22/ * <*23/ * V022 <*23/ * 

obtained by striking out any column of A. 

In many applications, it is useful to employ the notion of the rank of 
a matrix A. This is defined in terms of the dc'terminants of A. A matrix 
A is said to be of rank r if there ts at least one r-rowed determinant of A that 
is not zerOj whereas all determinants of A of order higher than r are zero or 
nonexistent} 


Example 3. If 


/I 0 1 3\ 

A s» I 2 10 —2 j, 

\-l -1 1 5 / 


the third-order determinants are 


1 0 1 
2 1 0 
~1 -1 1 

1 1 3 

2 0-2 

-1 1 5 


0 , 

0 , 



0 

1 

-I 



0 , 


t) 1 3 

1 0 -2 
-i 1 5 


0 . 


Since 


1 0 
2 1 


5 *^ 0, 


there is at least one second-order determinant difTf*rent from zero, whereas all third-order 
deit^rminants of A are zero. Therefore, the rank of A is 2. 

It should be observed that a matrix is said to have rank zero if all its 
elements are zero. 


* Cf. Appendix A, Sec, 2. 
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SEC. 16) 

If A =» (a»y) and B » (6,^) are two square matrices, then 

AB (aikhkj) 

and the determinant of the matrix AB is 

\AB\ == \atkbkj\- (16-6) 

We note with reference to (15-3) that the elements in the fth row and jth 
column of the df'lcrminant in (J6-()) are precisely those that appear in 
the prfKluct of two determinants [.d | = || and 17^| == \bij\. Thus 

\AB\=^\A\-\B\, (16-7) 

or in words, the determinant Ul/?| of the product of two matrices A and B 
7 S e(pial to the product of determinants |d | and |J?|. 

It follows from (16-7) that \\henever the product of two matrices is a 
zero iriatrix, thin the determinant of at least one of the factors is zero. 

A square matrix whose determinant is zero is (‘all(‘d a singular matrix . 


PROBLEMS 


1 . Make use of tlie definitions in Sec. 10 to establish the following theorems fot 
matrices: 


(a) A + B « B 4- A; (M (.1 -f B) -f C » A + (B + C); 

(c) (A + B)C - AC -b BC; {d) C{A + B) * CA -b CB. 


2. Verify that the matrices A and B in Example 1 of this section do not commute. 

3 . Multiply: 


(a) 


1 2 3 
3 1 2 
1 3 2 



(/>) 


1 2 3 

3 1 

1 3 2 


0 

0 

0 



4 . Show that (AB)C - AiBC). 

5. Determine the ranks of the matrices: 


I 2 3 

1 4 2 

2 6 5 


/I 0 1 
B - { 0 0 I 
U 1 1 




Is AB « BA? Is AB BA? Are these matrices singular? 

6. If AX « XA for every matrix A, show^ that X is a scalar matrix. Hint: I.^t 
X — (xtf), then since AX « A"A, OtjXjk - x^Uji, for all choir^es of a»^ and a^jt. Now choose 
«»/ where 5,(p) and ar*e tlie Kroneckei' deltas and p and q have fixed but 
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arbitral^ values ranging from 1 to n, and conclude that Sij «« «* 0 if i ^ j and 

x^i ^ Xjj for each i and j. 

17. Linear Transformations. The matrix notation introduced in the 
preceding section enables us to study effectively properties of linear trans¬ 
formations. 

A set of n linear relations 

Vt ~ j ^ 2, ..., Ti, (17-1) 

where the aij are constants, defines a linear transformation of the set of 
n variables x, into a new set yt. 

We can regard the quantities xi, x^, ..Xn as components (or measure 
numbers) of some vector x referred to a vset of base vectors ai, aa, ..an 
in the n-dimensional vector space. The quantities yi, 2/2, • *., 2 /n can be 
view^ed as components of another vector y referred to the same biisis. 
The relations (17-1) then represent a transformation of the vector x into 
another vector y. Since the lengths of x and y and their orientations 
relative to the base vectors at are different in general, wt can look upon 
the transformation (17-1) as representing a deformation of space. 

When the components of x and y are represented by the column ma¬ 
trices 


r\ 


I"] 

1 • 

Y =- 

I i/2 1 

\J 

' 

ij 


the set of relations (17-1) can be written in the form 

Y - AX, (17-2) 

where A == (a*^) is the matrix of the coefficients in the linear transforma¬ 
tion (17-1) and the product AX is computed by the rule (16-3). 

If A is a nonsingular matrix, we can solve Eqs. (17-1) for the x» by 
Cramer’s rule (15-7) and obtain the inverse transformation 


^ 1 



(17-3) 


where is the cofactor of the element in the determinant a « |a,j| 
of t)ie matrix A. 

The set of equations (17-3) can be written in matrix notation as 

X « 
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1 ^ 

1 

-421 

»-* 1 
1 


a 

a 

a \ 


' £22 

1 

( 

An2 

sss 

a 

a 

a 



A2n 

Ann j 

' a 

a 

a 


(17-4) 


It is natural to call the inverse matrix of A. We note that the 
inverse matrix can be constructed whenever A is nonsinguiar, that is, 
whenever the determinant | ^4 | s a does not vanish. 

If we form the product, of A and A'~^ 

(17-5) 

and recall ^ that 

atkAji( == 

we can write (17-5) as 

A A-' = (6„) = I, (17-6) 

where I is the identity matrix. 

Since the determinant of the product of two matrices is equal to the 
product of their determinants, we conclude from (17-()) that 


so that 


I 

Ul 


(17-7) 


Multiplying (17-6) on the left by *4“^ and on the right by (i4~^)~^ gives 

- 7 - AA-K (17-8) 

In addition to the inverse matrix A we shall make frequent use of the 
matrix 


(17-9) 



/«ii 

cr2i * 

• CLnl\ 

A' - 

fll2 

(^22 * 

’ Orn2 j 

1 

VOin 

®2n * 

' • CinJ 


* See (16-5), but note the relation of the subscripts on the to the rows and columns 
in (17-4), 
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obtained by interchanging the rows and columns in the matrix 


juii aj2 * * ‘ ^\h\ 
(l2\ 0-22 * * * ^l2n 

W/Il Rn2 * • ' f^nrJ 


(17-10) 


The matrix A' is called the transpose of A, 

On using the laws of addition and multiplication of matrices it is easy 
to show that 

(A + By - A' + 7T, 

(kAY - IA\ 

(Any - B'A'. (17-11) 

{Note order.) 

If ^\e recall tlie relation (17-8), 

A-Yi - AA-'\ 

and form the transpose 

(A~^i)' - (AA-^)', 

W6 get, on making use of (17-11), 

A'(A-~^)' - (d-')'A'. (17-12) 

Multiplying both members of (17-12) on tl)e left by (A')“‘\ we get 

{A'rKi'(A-~y == (d')~VA-“M'A'. 

Hence (A'^')' = (A')^'(A.1 - {A'r^. 

Thus {A~^y = (A'r\ (17-13) 

The important result embodied in (17-13) slates that the inverse of Ike 
iran^<fpose of the matrix A 'is apiat to tfa tiaosposi of its ini'( rsc. 

In many calculations it is necessary to com})ute the inverse of the 
product of two nonsingular matrices A and B. We can obtain the desired 
result as follows: Since 

{AUliAUr^ - /, 

or (see Prob. 4, Sec. 1(5) 

AB(AB)-^ = /, 

wo get, on multiplying both members of this relation on the left by A~^, 
A-^AB{AB)~^ = /I-' 
or BiAB)-'^ A-\ 

Multiplying this result on the left by B~'^, we get the desired result 

( Ajs )-‘ = b-^a-k 


(17-14) 
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(Note order.) This result can be extended in an obvious way to more than 
two matrices, so that, for example, 

(ABCr^ - 

Example 1. Compute A~^ for the matrix 


-D- 


All ~1, Ai2 * —3; A 21 =* A 22 « 1. 


Smcr a »» 1A 1 


Wi* note that \A M “ «• 1 /1/1 |. 

Example 2 If /i is a iionsinKulai matrix, siiow that the matric equatioas 

AX * / ami XA - I 

have' umtjiu' solutions A' = 

On multiplying l>oth lue'mhers of the givem equations by A~\ we get 

ami XAA~^ ^ lA 'l 

But A~h4 « « / ami « A~\ 


If we have two successive linear transformations 


Vi = 


(17-15) 


the direct transformation from the variables .r^ to the is obtained by 
inserting for the pj in the second set of Kqs. (17-15) from the first set. We 
thus get 

2 * = Vbfcn. (17-16) 

The transformation (17-16) is c.alled the product of the transformations in 
(17-15). If the variabl(‘s (^ 1 , 2 * 2 ,. .., r„), ( 1 / 1 AJ 2 . • • • dJn), and (2 i,^2,--.a 0 
are interpreted as comfionents oi the vectors x, y, z, represented by column 
matrices 
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we can write Eqa (17-16) as 


F«.4X, 

BY, 

and the product transformation (17-16) as 

Z « BAX, 


(17-17) 


(1748) 


Thm, when the variables Xi are subjected to a linear transformation (17-15) 
with a matrix A and the variables yi are subjected to a linear transformation 
with a matrix B, the product transformation has the matrix BA, Since BA 
in general is not equal to AB, the order in which the transformations are 
performed is material. 

When it is desired to interpret Eqs. (17-15) as transformations on the 
components of the vectors x, y, z, Eqs. (17-17) and (17-18) can be written 
in the forms 

y = Ax, 


2 = By, 


(17-19) 


z « BAx, 

where x, y, z are regarded as the column matrices X, Y, Z, respectively. 
The matrices in Eqs. (17-19) can be viewed as operators transforming a 
given vector into another vector. Since 

A{kx) ^ kAx, k const, 

and A{x + y) ^ Ax + Ay, 

one often speaks of 4 as a linear operator. 


PROBLEMS 



find A and Verify that {ABY « B'A' and (AB)-^ » 

2. Prove that {ABCy « C'B'A'. 

3. I^'ove that » A. 

4. Prove that if A is singular, there exists no matrix B such that AB « /. 

5 . If M xi cos a — X 2 sin a, yz « xi sin a + xz cos or, find A''^, A' and show that 
^ A*. If X is a vector with components (xi,z%), what is the geometric relation of 

X to y? Write out the inverse transformation x « 

6. If 2/1 a-i - X 2 , 1/2 *• ari -f x%, what is Is d.”-* « ^1'? If x is a vector with 

components (xi,:c 2 ), what is the geometric relation of x to y? 
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Vl 



1 


xa, 


compute the matrix A' for the inverse transformation and compare it with the given 
matrix A. If x is a vector with components {x\,x%,xz) and y is a vector with components 
iyuy^Vi), what is the geometric relation of x to y? 


8. Let 


und consider the vector 

-GD 

y a* Ax, 

Compute X — A^'y, Is it true that A' » A 

9. If 

Vl ** xi cos a 4* X 2 sin a, 


2/2 ■« —xi sin ot 4- X 2 cos a, 

and 

2 i » 2/1 cos ^ 4“ 1/2 sin /3, 


252 ** — 2/1 sin /3 4- ^2 cos 


find the product transformation directly and also by computing the product of the 
matrices as in (17-18). Compute BA and AB, and {BA)'^^. Also find {BAY 

and compare it with (BA)‘~^. 

10. If 

I/l ** 2X1 + ^2, 

1/2 « Xi - X2, 

and zi «■ Vl - 2 / 2 , 

22 a* 2yi -f 2 / 2 , 

perform the calculations required in Prob. 9. 

18 . Transformation of Base Vectors, In the preceding section we in¬ 
terpreted the set of linear relations 

Vi ~ ( 18 - 1 ) 

as transformations of components (a:i,X2,. x^) of a vector x into com¬ 
ponents (yi,t/2r* * ^yVn) of another vector y when the vectors are referred 
to the same basis (a,), so that 

X *= a;,a* and y = 2/,a,*. ( 18 - 2 ) 

If we introduce a new system of base vectors a^, obtained from the set 
a^ by a linear transformation 


a* « 


with 1 5 *, I 9^ 0, 


( 18 ^) 
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the vectors x and y in the new’ reference S 3 ^stein will have certain representa¬ 
tions , / 1 O 4 \ 

We raise two questions: (1) What is the relation of the components of 
vectors in the two representations (lS-2) anti (18-1) when the base \ e(*tors 
are transformed by (18-3j? (2) AVhat is the form of the transformation ol 
the components into i?, which correspoiKls to the deformation of the 
vector X characterized l\v lOqs. (18-1)? 

To answer the first question we insert from (18-3) in (18-1) and t^et 

X (18-5) 

while a reference to (18-2) shows that 

X = a,.r, “ aLj.Tj. (18-6) 

From (18-5) and (18-6) we coneludt' that 

This formula is tht" desiretl n‘lation‘^hip eonnt'ctim; tht* < of x 

when it is referrt'd to two (hff(»rent ba^(‘ ."^vslem-. i da ted by 118-3 

We note tluit in tlio transformation (l<S-3) the ^uiTirnat loii is on tlu^ 
second imlex i wtiile in (18-7) it is on l}i<‘ lir^t ind(‘x. In other words, the 
matrix of ccKdricients 6;, in (18-7) is the transpose of the matrix ib^j) in 
(18-3). 

If we w’rite the matrix in (18-3j as 

0>.j} - 

the set of equations (lS-7) e ui b(‘ wiitten as 

X =- (18-8) 

X and I beinfi; the column matrice'- with components .. .,and 

(fbf2) • • • 

On multipbdng (18-8) by on the left w(^ ^et the solution for f 

in the form 

Formulas (18-8) and (18-9) pvc a (‘omjdete answer to the first question 
The relationship eonne(*tin«; the compcnKails (//i,// 2 . ‘ . ,^n) ^'ith (?/i, 
i? 2 , • • ’jVn) can be represented similarly bv^ 

y === B'y\ and y] -= (B')~'^y, (IS-lO) 

We proceed next to the answer of the (iu(*sti<m eoueerninp; the form of 
the deformation of space (18-1) in tine new reference frame 01 *. 

We WTite Eqs. (18-1) in matrix form as 

y Ax, 


(18-11) 
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substitute for x from (18-8) and for y from (18-10), and obtain 

- .4^^. (18-12) 

To solve for ij we multiply on the left by and get 

ti = {B'r^AB'l (18-18) 

Thus the relationship between the components (fi,$ 2 ,-• ^in) and (??i,i? 2 > 
... ,r)n) is determined by the matrix 

,S ^ {B'r^AB'. (18-14) 

Since the matrix S characterizes the same deformation of space as the 
matrix A , the matrices A and S related in the manner of (18-11) are termed 
dmilar. To avoid carrying primes, we set B' — (\ and formula (18-14) 
then assumes the form 

S ^ (18-15) 

and (18-13) becomes 

n - SI (18-10) 


One of the important problems in the theory of linear transformations is 
to determine a r(‘ferenc(' frame in vhich the equations lor the deformation 
of spa(*o assume lorrns hub admit of simple mterpretat ions. For example, 
if it proves possible to find a matrix C such that the matrix S in (18-15) 
has the diagonal form 


- 


then F(|. (18-10) shows tliat 


/X. 

0 ••• 


0 

X2 ••• 




• \J 

Vi 

ll 


V2 

== ^2^27 


Vn 




(18-17) 


In three-dimensional space these correspond to simple elongatioas (or 
contractions) of the components of the vector in the directions of base 
vectors at determined by the matrix C ~ B' [see (18-3)]. 

Whetlier or not a matrix C reducing A to the diagonal form S can be 
found clearly depends on the nature of deformation specified by A. In 
many problems in dynamics and in the theory of elasticity, the deforma¬ 
tion matrix A will turn out to be symmetric, and we shall see in Sec. 20 
that such matrices can always be diagonalized by finding a suitable ma- 
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trix C. This fact turns out to be of cardinal importance because it enor¬ 
mously simplifies the analysis of many problems. 

In the following section we shall study properties of the matrix A in 
(18-1) for those transformations that leave the length of every vector x 
unchanged. In three dimensions such transformations represent rotations 
and reflections. 

19. Orthogonal Transfonnations. Let us refer our 7i-dimensional space 
to a set of orthonormal base vectors ei, 02 , .. ©w, introduced in Sec. 13, 

Relative to this basis the vector x has the representation 


X = e»x, 


and its length |xl can be computed from the formula 

|xp = XtX^. (IQ*!) 


Let us investigate the structure of the matrix A in the class of transforma¬ 
tions /1 A 

Vi = (LL2) 

which leave the length |x| of the vector unchanged. Now, the scjuare 
of the length of the vector y is 

lyp = (11K3) 

and since we suppose that |xt = |y|, 

« x,Xi. (19-4) 

We insert in (19-4) from (19-2) and get 

{aijXj){aikXk) = ar,x, 

or a^ja^kXjXk = (19-5) 

since 

On equating the coefficients of XjXk in (19-5), we get the set of restrictive 

conditions . 

*”■ ^)k (19-()) 


on the coefficients a,y if the transformation (19-2) is to leave the length of 
every vector unchanged. 

Equations (19-6), when written out for n = 3, are 

Gii + U 2 I 4* O 31 =« 1 , 

+ 022 + 032 »= 1, 

^13 "b O 23 + O 33 ** L 
“b O22O23 + 032O33 0 > 

+ 023^^21 + 033^*31 ** 

R 11 O 12 + ®2i023 -b 08ia32 « 0. 
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The determinant of the matrix in (19-6) is 

I ciijaik I = I fi/fc I “ 1, (19-7) 

and if we recall the rule for multiplication of determinants [cf. (15-2)], 
we conclude from (19-7) that 

I oo-a.it I = 1 o.,- 1 • I oo I “ “ 1. (19-8) 

where a is the determinant of (oo). Equation (19-8) states that 

ct =» zfcl. 

In three dimensions the situation when a = 1 corresponds to a rotation 
of space relative to a set of fixed zyz axes determined by the unit vectors 
i, j, k. The circumstance when a — 1 corresponds to a transformation 
of reflection (say, x = -x, y « —y, z = —s) or to a reflection followed by 
a rotation. 

A transformation (19-2) in which the coefficients satisfy (19-6) is 
(‘ailed an orthogonal transformaiion; it is called the transformation of rotation 
if |a»;| = 1, whatever be the dimensionality of space. 

If we denote by A' the transpose of {a^f) = A in (19-2), we can write 
the orthogonality condition (19-0) in matrix form as 

AVI « L (19-9) 

On multiplying this by A on the right we get 

A' = A~^ (19-10) 

Thus, in an orthogonal transformation the inverse matrix A*~^ is equal to 
the transpose A ' of A. 

When Eqs. (19-2) are wTitten in the form 

y « Ax, 

we can write their solutions for the x, as 

X A~V- (19-11) 

We conclude from (19-10) that the solutions of Eqs. (19-2), when the 
transformation is orthogonal, are 


X| « (19-12) 

In Sec, 17, we saw that the matrix of tlie product of two linear trans¬ 
formations is the product of the matrices of the component transformations. 
Using this fact and the property (19-10) it is easy to show that the product 
of two orthogonal transformations is an orthogonal transformation. 
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PROBLEMS 


1. Verify that the transformations 


(a) 

Vi 

a« Xi COS a — X 2 sin a, 


3/2 

« Xj sin a -}“ ^2 cos a, 

and 



(6) 


1 1 

2/1 



3/2 

*= X2, 



1 1 


.V3 



an* orthogonal. Do thoy roprest^nl rolalions? 

2. Discuss the traiivsformation 






V3 


2 2 1 


Find the inverse transformation. 

3. Prove that the product of any number of orthogonal transformations is an orthog¬ 
onal transformation. 

4 . If A i^ a symin(‘tric matrix (so that J' = A) and S is an orthogonal tranofoi inntion, 

prove that the matrix B == is symim*tric Thus, orthogonal trausforrnationvs do 

not destroy the symmetry of A. 

6 . 

( 1 I 0 
1 2-1 
0-1 a 

and let C be an orthogonal matrix 


C 


Til 

Cl2 

Cl3' 

^21 

C22 

C 23 

sC31 

C 32 

C33> 


Write out the set of equations which the c,; must satisfy if C'^^AC « B, where 4 .S is a di¬ 
agonal matrix 

6, Is the transformation 

3/1 =“ 3-^1 

yi * -2x1-4- xa, 

orthogonal? Find the inversci transformation. Determine the components of x: (xi,ja) 
and y: (yi,ya) when tlie base vectors ei, ea are rotated through 45 and 90®. 
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7. If yi «* OijXj is a linear transformation for the components of a complex vector 
x: (a;i,X 2 ,... jXn), which preserves the length |x| of the vector, show that dxjaik »» 
or A*A «* /, where A is the conjugate matrix formed by replacing every element a,y of 
A by Transformations such that A' «* A are called unitary; they are of great 
importance in quantum mechanics. 

20. The Diagonalization of Matrices. We saw in Sec. 18 that the 
determination of a nonsingular matrix C such that the given matrix A 
reduces to the diagonal form S by a similitude transformation C^^AC is 
equivalent to determining a set of base vectors relative to which the trans¬ 
formation 

Vi == a^jZj (20-1) 

assumes the form 

Vl = V2 = ^2^21 (20-2) 

We thus seek a solution of tlie matric equation 

C^^AC = S (20-3) 

in which A = (a,j) is a given matrix, C the unknown matrix 



'Cu 

(’12 *' 

’ ‘ Ciifc • 

' • Cin\ 

C21 

C22 • ■ 

' * • 

•' (’2n 

«'nl 

Cn 2 • 

" Cnk " 

' ‘ (^ntJ 


and S is the diagonal matrix, 


Ai 

0 ■ 

.. ox 


X 2 • 


lo 

0 • 

• X„/ 


(20-4) 


(20-5) 


On multiplying (20-3) on the left by C we get an equivalent matric equation 

AC = C*S, (20-6) 

provided that the solution of (20-0) yields a nonsingular matrix C. 

Now the matric equation (20-6) is equivalent to a system of linear equa¬ 
tions 

atjCjk - Cikkk, no sum on A:, A; == 1, ..n (20-7) 

obtained by equating the corresponding elements in the products AC and 
CS, 

For every fixed value of A:, the system (20-7) represents a set of n linear 
homogeneous equations for the unknowns (cik,C 2 k,* • .,Cn*) appearing in the 
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kih column of (20-4). The fact that the system (20-7) is homogeneous 
can be made plainer by rewriting it in tlie form 

{Uij - S^j\k)cjk = 0, no sum on k. (20-8) 

W(i recall' that a system of homogeneous equations has solutions other 
than tlie obvious solution cu — c^k — ' ‘ ~ Cnk = 0 if, and only if, its 
determinant ® 

la,,(20-9) 

On writing out this determinant in full, 

ail — X ai2 * • • a\n 
U21 <^22 — X • • * a 2 n 

- 0 , ( 20 - 10 ) 

Qnl ^n2 * ‘ ' ^nn ^ 

we see that (20-10) is an algebraic eciuation of degree n in X. Accordingly, 
there are w roots of this equation, say X ~ Xi, X == X 2 , ..X = Xn, and 
corresponding to each root X = X^ (/; = 1, .,a), the system (20-8; will 
have a solution 

(^ikf^2kj' ♦ • jC/jA:)* (20- 1 1) 

Ihe solution (20-11) yields the A-th column of the matrix C. If the roots 
\i, X 2 , ..., X„ are all distinct, one can pro\ {‘ that the matrix C will be non- 
singular.^ When the roots X^ are not distinct, it is impossible, in gruHTai, 
to reduce A by the similitude transformation (20-3) to tlie diagonal form, 
])e(‘ause the dcsire<l noiisingiilar matrix C may not exist. In important 
special cases, however (for example, wIkui A is a real and symraetne ma¬ 
trix), one can construct C such that S has the diagonal form even when 
some, or even all, roots are equal. 

A brief discussion of this is contaiiKHl in the following section. 

As a matter of terminology, Eq (20-9) is called the characteristic equation 
and its solutions are characteristic values of the matrix (a,,). The solutions 
(20-11) of the sy.st<'m (20-8) corresponding to these characteristic values 
are called characteristic vectors} 

^ Appendix A, 8<’c. 2. 

* Note Ihfii this (jcterminantal equation when i^ritten in matrix form is | A — X/1 « 0. 

* Or in Uie knguage of vectors, if we re<;ar<l each column of C as a vector 
{cikte 2 kf • - fCnjt), the vectors c^^Hk « 1, 2, ..., n) will be linearly independ(‘nt. A simple 
proof of tliis is given in I. S. BokolnikoflF, “Tensor Analysis,” pp. 33-34, John Wiley 
& Sons, Inc., New' York, 1951. 

<The hybrid terins mgemmluets for the X* and eigenvectors for the {c\k^c%k ,>. 

Are used by some writers who do not mind mixing (lerman with English. 
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Example 1. Reduce the matrix 
Ji » 

to the diagonal form S by the similitude transformation C’^^AC, 
The characteristic equation (20-9) here is 


( 20 - 12 ) 


Its solutions are Xi *» 0, X 2 » 2, The desired matrix C in our case has the form 



the columns in which satisfy the system of equations (20-8), yielding 
(oii ~ X;i)cufc + ai 2 r 2 A; * 0, no sum on ky 

021^1* -f (a22 — Xjt)riyk » 0, A: « 1, 2. 

Since an « 1, 012 *■ —1, an « —1, 022 « 1, we get^ on setting A: *« 1 and Xi 

Cl, - C 21 * 0, 

•~Qi + C 21 =a 0. 


(20-13) 


(20-14) 

0 , 

(20-15) 


As is always the case with nontrivial homogeneous systems of equations,* there are 
infinitely many solutions of the system (20-15). If we set cn « a (any eonslant), Ecjs. 
(20-15) give C 21 * a. 

Thus the vector (rn,r 2 i) appearing in the first column of (20-13) has the compo- 
iK'iit^ cij « C 21 a. Since any matrix C accomplishing the reduction will do, we can 
take * o == 1. 

The substitution of Aj =*= 2 and X 2 *■ 2 in (20-11) yields the system 

(1 - 2)ri2 - 022 * 0, 

—C12 -f (I — 2)022 « 0 , 


or 


— 0,2 — C 22 0. 


Again there are infinitely many solutions, and if we take 012 * a, then 022 “ —a. We 
can set a « 1 if we w’lsh, so that the elements of the second column m (20-13) are 0,2 *■ 1,, 
C 22 « — 1. The desired matrix T, thei efore, is 


C = 


c 



The inverse of C is easily found to be 

a» 

SO that C'^^A C is 

\ / 1 -i\ /I 
\yi -w ' v-i 1/ ■ \> 


('i M \ 
Wi -M/’ 



* See Appendix A, Sec. 2. 

• Usually one normalises solutions so that the length of the column vector is 1, 
This would correspond to the choice of o *» 1/V2. since ch -f c% «- 1. 
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Oii multiplying these matrices we get 



as we should, since 



(20-16) 


as we knew from the start [see (20-5)]. 

If we interpret 4 as a matrix operator characterizing the deformation of a vector x 
into a vector y (see (18*11)], the result (20-16) states that in a suitable reference frame 
the components of x and y are related by 

VI " 0 *^ 1 , V2 =*= 2 ^ 2 - 

We thus have a deformation of space corrasponding to the twofold elongation in the 
direction of one of the base vectors. In the notation of Sec. 18, C * B*, so that one can 
actually write out Eqs. (18-3) for the transformation of the base vectors. This, how¬ 
ever, is seldom required because the essential matter is to determine the deformation 
characterked by A rather than a reference frame giving a simple form of the deformation. 

Example 2. Determine the characteristic values of the matrix 

/ ^ 

(o.,)-^-l 1 -ij- (20-17) 

The characteristic equation this time is 


1 - X 


-1 

~1 




(1 - - 3(1 ~ X) - 2 * 0. 


We easily check that the solutions of this cubic are Xi = 2 , X 2 = 2 , X 3 » — 1 . Since we 
have a double root Xi «» X 2 — 2 , the solution of the sysb^m ( 20 - 8 ) will enable us to de¬ 
termine only two linearly independent columns of the matrix C. The matrix (20-17), 
however, is real and symn^iric, and one can, in fact, construct the third column of the 
matrix C such that 

C-^AC - S, 

( Xi 0 0 \ ^ 0 0 \ 

0 X 2 oJ«fo 2 oj* 

0 0 X 3 / \0 0 - 1 / 

However, the theory presented in this section does not explain how this can be accom¬ 
plished. 


1 . Diagonalise the matrix 


PROBLEMS 



and determine, in the manner of Example 1, the matrix C. Discuss the meaning of A 
when viewed as an operator characterizing a deformation of space. 
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2. Find the roots of the characteristic equation for the matrix 



8 . Find a matrix C reducing 

/-I 

A-l 0 - 
\ 0 

to the diagonal form by the transformation C 

4 . Diagonalize the matrix 

2 4 

4 2 

5. Prove that the roots of characteristic equations of all similar matrices are equal. 

Hmt: Write the charact-eristic equation of C'^^AC [cf. (20-9)] in the form — X/| 

» 0. But IC-^AC - X/| * - \T)C\ ^ \A - X/|, since |C“M « 1/|C|. 

21. Real Sjrmmetric Matrices and Quadratic Forms. Let the matrix 
A «= (a^J) in a linear transformation 

Vi = i, i 1, ,.n, (21-1) 

be real and symmetric, so that A' — A (or = ay»)- We shall indicate 
that in this case the matrix A can always be reduced by the transformation 
to the diagonal form S. Moreover, C can be chosen as an orthogonal 
matrix; that is, a matrix such that = C' [cf. F]q. (19-10)]. 

Linear transformations with real symmetric matrices dominate the 
study of deformations of elastic media. Real symmetric matrices also oc¬ 
cur in the st^udy of quadratic forms 

Q(xi,X 2 , ... ,x„) s a,jX»X;, f, j = 1,2, ..., n, (21-2) 

which arise in many problems concerned with vibrations of dynamical 
systems. 

We can always suppose that the coefficients in a quadratic form (21-2) 
are symmetric because every quadratic form Q can be symmetrized by 
writing it as 

Q = H(«.; + 

== bijX^Xj, 

in which the coefficients 

bij »= + a^i) 

are obviously symmetric. Henceforth we shall suppose that our quadratic 
forms have been symmetrized so that = ay*. 

It will follow from discussion in this section that the problems of reduc¬ 
tion of the transformation (21-1) mth symmetric coefficients to the form 

Vi ** ~ - ^ nn ** (21-3) 
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and of the quadratic form (21-2) to the form 

Q *= Xi{f + x^fl + • * * + Xnfn (21-4) 

are mathematically identical. 

We first note several properties of quadratic forms. If the variables Xx 
in (21-2) are subjected to a linear transformation 

Xx « (21-5) 

the form (21-2) becomes 

Q = axjiCxk^k){Cjr^r) 

= ai/»JkOrfA:ir. 

We denote the coefficients of ikir by hkr^ so that 

Q - bUkirx (21-0) 

where bj^r == cttjCarjr. (21-7) 

Since i and j are the summation indices and = Qjx, we >se(' that the valu(‘ 
of bkr is not changed by an interchange of k and r. Tlius, \s(‘ conclude 
that the symmetry of the coefficients in a quadratic form (21-2) is not 
destroyed when the variables x, are changed by a linear transformation 
( 21 - 6 ). 

If we write (21-7) in the form 

bkr ~ i 

we see that the sum UxjCjr is an element in the ith row and the rth column 
of the matrix 

AC 

or (ctxjCjr) ^ (c/,r). 

The product Cxkic^ijCjr) = Cxkdtr is the element in the kih row and the rth 
column of the matrix C'D, Thus we can write (21-7) as 

B - CAC, (21-8) 

The result (21-8) can be stated as a theorem. 

Theorem. When the variables Xi in a quadratic form (21-2) with a matrix 
A are subjected to a hncar transformation (21-5) with a matrix C, the resulting 
qmdratic form has the matrix C'AC. 

If the linear transformation (21-5) is orthogonal, then C' « and 
hence (21-8) can be written as 

^ B « C^^AC. (21.9) 

We conclude from (21-9) that the redvxiim of a symmetric matrix to the 
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diagonal farm by an orthogonal iran^ormaHon calls for a solution of ifie 
matric eguation 

S «= C-~UC. (2M0) 

This equation is identical with that considered in the preceding section. 
When the roots of the characteristic equation 

|«0 0 (21-11) 

are distinct and real, the method of Sec. 20 enables us to compute a matrix 
C which can be shown to be orthogonal. As a matter of fact, the desired 
matrix C can always be found whenever the matrix A is real and symmetric. 
Moreover, it can be shown that the roots of symmetric real matrices are 
invariably real.^ 

Tho fact that the columns of C7are linearly independent can be established easily when 
A is symmetric and the \k an; all unequal. Let c lx; a characteristic vector for X and 
c' be a charact/cristic vector for a different value, X'. Then, from (20-7), 

UtjCj » Xr,- and » XVj. 

Multiplying these equations by and r*, respectively, gives 

a^jc'-fCj Xcjc, and ~ XV,-C( 

after summing on i. Since a^j a;^, the left sides of thcvse equations are equal. Hence 
by subtraction, 

0 «« Xcjci — X'c,cJ « (X — XOcicJ. 

Since \ ^ X' we get c*c' «“ « 0, so that the vectors c and c' are orthogonal and thus 

liiKiarly independent. 

If the roots X, are all positive, Eq. (21-4) shows that the quadratic form 
(21-2) assumes positive values for all nonzero values of the variables 
Such quadratic forms are called 'positive definite,. They appear in numerous 
investigations in mathematical physics. 

An analogue of a symmetric (quadratic form (21-2) in which the variables x, are com- 
ph'x is a hUinear form * 

H a* (hjfaj (21-12) 

in which 0,7 «» Such forms are called Hermitian, and t.heir matrices ( 0 * 7 ) * A are 
Ifermttian matrices. Since a,y *» dy*, it follows that the elements on the main diagonal 
of A are necessarily real and that 

A'«I. 

From the structure of ( 21 - 12 ) it follow's that the Hermitian forms assume only real 
values for arbitrary complex values x,-, for on taking the conjugate of ( 21 - 12 ), we get 

B «» etxfCxfj « ajitjXi * //, 

which proves that H is real. 

^ For proofs utilizing the noUtion of this section, see I. S. Sokolnikoff, ‘Tensor Analy¬ 
sis" pp. 37-40, John Wiley & Sons, Inc., New York, 1951. 

* Cf. Prob. 7, Sec. 19, 
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Hermltian forms occur in quantum mechanics, and a discussion of the reduction of a 
quadratic form to a sum of stiuares (21-4) can be generalized to show that (21-12) can 
be reduced to the form 

H -f X2i2£2 4- • * * + Xftln^n 

by a linear transformation (21-5) with a unitary matrix C defined in Frob. 7 of Sec. 19. 

22. Solution of Systems of Linear Equations. In Sec. 15 we derived 
Cramer^s rule for solving the system of equations 

= 6 ,. ( 22 - 1 ) 

When the number of equations in ( 22 - 1 ) is large, Cramer's rule is inefficient, 
since it requires evaluating determinants of high orders. For this reason 
all practical methods of solving the system ( 22 - 1 ) depend on reducing it 
by some process to an equivalent system whose matrix is sufficiently 
simple to enable one to compute the unknowns without great effort. 

The system ( 22 - 1 ) can be written in matrix notation as 

Ax = b, (22-2) 

where A ~ (a,j), x is the column matrix (Ji,r2,. .. ,a"n), and b is the column 
matrix ( 61 , 62 )* • If ^ is nonsingular, the solution of ( 22 - 2 ) is 

X = (22-3) 

so that the determination of unknowns hinges on constructing the inverse 
matrix .*4 . The development of effective methods for inverting matrices is 

a major problem of numerical analysis. One of such methods depends on 
a reduction of the system ( 22 - 2 ) to an equivalent system 

Bx = c, (22-4) 

in which B has the triangular form 



612 

<>,3 • 

bln\ 

0 

1 

bza 

• • h2n 

'0 

0 

0 • 

■■ 1/ 


in which the elements below the main diagonal are all zero. When the 
system (22-4) is written out in full, it has the appearance of Eqs. (4-2) in 
Chap. 9 , whose solutions, as shown in Sec. 4, Chap. 9, can be obtained 
quite readily.^ 

Among other methods for solving the system (22-2) is the method of 
orthogonalization, the essence of which is as follows. Let us seek a matrix 
such that the product 

CA * J> 


‘ This is the so-called Gauss reduction method discussed in Chap. 9. 


(22-5) 
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is an orthogonal matrix. Since D is required to be orthogonal, it follows 
from (19-9) that 

DZ)' » D'D «= /, (22^6) 

where D' is the transpose of D. 

On multiplying (22-2) on the left by we get 

AX'CAx - A'C'Cb, (22-7) 

and since 

A'C' - {CAY = D' 

by virtue of (17-11) and (22-5), we can write (22-7) as 

D'Dx - D'Ch, 

However, by (22-6) D'D « /, so that we finally have 

X = D'Cb. (22-8) 

Formula (22-8) gives the solution of the system (22-1) once a matrix C is 
determined. We do not present the classical procedure for constructing 
C (known as the (Irara-Schmidt method) because of the rather special 
character of the problem. 
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This chapter is concerned with a study of scalar and vector functions 
defined in the familiar tfiree-dimensional space. It includes a discussion 
of curvilinear coordinate systems and a derivation of several transforma¬ 
tion theorems involving line, surface, and volume integrals. These theo¬ 
rems, usually associated with the names of Gauss, Green, and Stokes, are 
indispensable in the study of mechanics of fluids, thermodynamics, and 
electrodynamics and in virtually every branch of mechanics of deformable 
media. 


COORDINATES AND FUNCTIONS 

1. Curvilinear Coordinates. The chief advantage of formulating rela¬ 
tions among geometrical and physical quantities in the form of vector 
equations is that the relations so stated are valid in all coordinate systems. 
Only when orui comes to consider a special problem involving numerical 
computations does it prove desirable to translate vector equations into 
the language of special coordinate systems that seem best adapted to the 
problem at hand. For example, in analyzing vibrations of clamped rec¬ 
tangular membranes, it is usually advantageous to express the displace¬ 
ment vector in cartesian coordinates. In the study of heat flow in a sphere, 
the geometry of the situation suggests the use of spherical coordinates, 
while problems concerned with the flow of currents in cylindrical con¬ 
ductors may indicate the use of cylindrical or bipolar coordinates. All 
these coordinate systems are but special cases of the general curvilinear 
coordinatf^ syst^em which we proceed to describe. 

l^et us refer a given region R of space to a set of orthogonal cartesian 
axes yi, 2 / 3 . We denote the coordinates of any point P in JK by 
(Fig. 1 ) instead of the familiar labels {x,y,z). A set of functional relations 

^ xi{yx,y2yyz), 

^2 « ^2(2/1,2/2,ya), 

357 


(M) 



368 


VKCTOR FIELD THEORY 


[chap. 5 

comiectii]^ tine variables yi, y^, ya with three new variables xi, X 2 , xs is 
said to represent a tramformaiim of coordinates. We shall suppose that 
the functions Xiiyi^y^^yz) (t * 1, 2, 3) are single-valued and are continu¬ 
ously differentiable at all points of the region R and that Eqs. (1-1) can 
be solved for the y» to yield the inverse transformation 

Vi “ 

y2 « y2(^h^2,^z), ( 1 - 2 ) 

ys « ys{xi>^2^^3)f 

in which the functions y,(xi,X 2 ,X 3 ) are single-valued and continuously 
differentiable with respect to the variables x». The transformations (1-1) 



and (1-2) with these properties establish a one-to-one correspondence be¬ 
tween the triplets of values ivi.yz.yz) and (xi,X 2 ,X 3 ). We shall term the 
triplet of values (xi,X 2 ,X 3 ), corresponding to a given point P(yi,y 2 )?^ 3 ), 
the curvilinear coordinates of P, and shall say that Eqs. (1-1) define a 
curvilinear coordinate system Xj,X 2 ;X 3 . The reason for this terminology is 
the following: If we set in (1-1) Xi = Ci ( a constant), the equation 

xi{yuy2,yz) = ci (1-3) 

represents a certain surface Si. Similarly, equations 

X2{yhy2,ys) « C2, (1-4) 

and Xz{yi,y 2 ,yz) « C 3 , (1-5) 

represent surfaces S 2 and Sa- These surfaces, shown in Fig. 2, intersect 
at the point P whose cartesian coordinates (yi,y 2 ,y 3 ) can be obtained by 
solvmg Eqs. (l-3j to (1-5) for the y*-. 
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The surfaces Si are called coordinate mrfaces^ and their intersections 
pair by pair are coordinaie lines Xi, X 2 , xa. Thus, the Xj coordinate line 
is the line of intersection of the surfaces Xa « Ca and xa « cg. Along this 
line the only variable that changes is xx, since xg « Ca and xa « Ca along 
the line Xi. Similarly, along the xa coordinate line the only variable that 
changes is Xa, while along the X 3 line the only variable that changes is X 3 . 

A very special case of the set of Eqs. ( 1 - 1 ) is 

Xi « pi, 

^2 « 2 / 2 , ( 1 - 6 ) 

Xa = 2 / 3 . 

If we set Xt = Ci (i == 1 , 2, 3) in 
( 1 - 6 ), we get three planes yi = c, 
perpendicular to the y coordinate 
axes. These planes intersect at the 
point (ci,C 2 ,C 3 ). The coordinate sur¬ 
faces in this case are planes, and 
their intersections pair by pair are 
straight lines parallel to the coordi¬ 
nate axes. 

As a more interesting example 
consider a transformation 

2/1 *= r cos By 

2/2 = r sin By (1-7) 

2/3 = 2, 

which is of the form ( 1 - 2 ) if we set Xi = r, X 2 — By X 3 « z. The inverse 
of (1-7) is 

r = 4 . -f 2 / 2 , 

B «= tan""^ —» (1-8) 

l/i 

2 = 2/3, 

and it is single-valued if we take 0 < ^ < 27r and r > 0 . The surface 
r «= Cl is a circular cylinder y\ + yl- whose axis coincides with the 
2/3 axis (Fig. 3). The surface 0 « C 2 is the plane ya = (tan 02 ) 2/1 containing 
the 2/3 axis, while the surface 2 ; = C 3 is the plane 2/3 = C 3 perpendicular 
to the Va axis. The r, By and z coordinate lines are shown in Fig. 3, and 
we recognize that the curvilinear coordinate system r, By z is the familiar 
system of cylindrical coordinates. 




vucTOK nmt> rauoRT 
As a final sxampla, consider the transformatiozi 
Pi « p sin 0 cos 


with the inverse 


2/2 « p sin ^ sin 
2/3 = p cos 6, 


(<mp, 6 


(1-9) 



$ = tan 


<t> = tarr 


+ yi, 

, ■v^TH 


( 1 - 10 ) 


2/2 

2^1 


which is single-valued if we suppose 
thal p>0, 0<^<7r, 27r. 

I'he transformation defines a 
spherical system of coordinates. 
The coordinate surfaces p = const, 
6 = const, and <f> — const are, re¬ 
spectively, spheres, cones, and 
planes, shown in Fig. 4. The co¬ 
ordinate lines are the meridians, the lines of parallels, and the radial lines. 


PROBLEMS 

1 - OiacuBS the curvilinear coordinates determined by 

2/1 ar a-l -f X2 + ^3, 

2^2 * a:i - 0*2 -f J3, 

t/8 * 2xi 4- ^2 ~ X3. 

2 , Show by geometry that the coordinate lines in cylindrical and spherical coordinate 
systems intersect at right angles. 

2. Metric Coefficients. In this section wo introduce an abridged no¬ 
tation which will enable us to WTite many formulas compactly and without 
loss of clarity. Thus, wc shall write the set of three equations of trans¬ 
formation (1-1) in the form 

= a:»(2/t,2/2,2/3), t = 1, 2, 3, (2-1) 

aAd their inverse (1-2) as 




(2-2) 


OOOEBIKAtES AND FtJNCTIONS 
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Throughout this section we shall suppose that the Latin indices k 
have the range of values 1, 2, 3. 

If P{yuy 2 jy^) is any point referred to a set of cartesian axes y (Fig, 5), 
its position vector r can be written in the form 

t = i\yi + 12^2 + ial/a; (2»3) 



where the ii, 12 , 13 arc tlie unit base vectors, which in Chap. 4 we denoted 
by i, j, k. 

The square of the element of arc ds along some curve C has the form 
{dsf = {dy,f + {dy 2 ? + {dy^?, (2^) 

and since 

dr = ii dyi + 12 dy^ + 13 dy^y (2-5) 

we can WTite (2-4) as a scalar product 

3 

{d^y^ = X) dy^dy^ == dx-di. (2-6) 

laaal 

If we replace the j/* in (2-3) by tlieir values in terms of the xs with the 
aid of (2-2), r becomes a function of the variables and we can write 


dx 


dx 

— dxi + 
dxi 

— 

.-15a;. 


dr 

dX2 


dX2 + 


dx 

■~dX3 

8 x 3 


3iiXi,X3,X3) 


(2-7) 


Now, the symbol 
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denotes the derivative of r with respect to a particular variable Xi (i « 
1, 2, 3) when the remaining variables are held fast. Thus, if we fix the 
variables X 2 and x^ by setting X 2 *= C 2 and xg » cg, r becomes a function 
of xi alone, and hence the terminus of r is constrained to move along the 
Xi coordinate line in the x coordinate system determined by Eqs. (2-1)* 
Consequently, the vector 

dr Ar 

— = lim — 

dXi Axj 0 Axi 


is tangent to the coordinate line xi. Similarly, we conclude that the 
vectors dr/dx 2 and dr/dxg are tangent to the 0*2 and xg coordinate lines, 
respectively (Fig, 5). If we denote these vectors by a„ so that 

dT 

a, « —. (2-8) 

dXt 

we can write (2-7) as ^ 

dr = a, dxi (2-9) 

and hence Eq. (2-6) assumes the form 

(dsf = (r a. • (E a. (2-10) 


On expanding the scalar product in (2-10), we see that formula (2-10) 
can be written as 

3 3 

(ds)^ * X) 23 Rt'Rydx. dxy 
and hence, with defined by 


we can write it as 


3 3 


(ds)^ « 23 23 dx» dxy. 

»«i y-i 

In expanded form this reads 

(ds)^ =* giiidxi)^ + gi2 dxi dx2 + giz dxi dxg 
+ ^21 dX2 dxi + g22idX2f + ^23 dX2 dXg 
+ gzi dxg dxi + ^32 dx^ dX 2 + 


( 2 ^ 11 ) 

(2-12) 


(2-13) 


Since ai-ay = ayai, we see from the definition (2-11) that g^j = g^^. Thus 
the qn/jdratic differential form (2-13) is symmetric. 

For reasons which will appear presently, the coefficients fir»y in this 
quadratic form am called metric coefficients. We shall see that they can 
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be computed directly from Eqs. (2-2) without first calculating the vectors 
a,'. 

The vectors which were found to be tangent to the coordinate lines 
Xi at a given point P, are called hose 
vectors in the curvilinear coordinate 
system x. Any vector A with the 
origin at P can be resolved into com¬ 
ponents Ai, A2, A3 along the direc¬ 
tions of the vectors ai, a2, as (Fig. 

6). Thus, the base vectors a* play 
the same role in the system x as the 
base vectors ii, i2, U do in the car¬ 
tesian system y. It should be noted, 
however, that while the magnitudes 
and directions of cartesian base vec¬ 
tors are fixed, the vectors a^, in gener¬ 
al, vary from point to point in space. 

From the definition (2-11) we see 
on setting t = y « 1 that the length 
of ai is |ai| = V^. Similarly, |a2| ** \/^ and laal * \/^. These 
vectors are orthogonal if, and only if, 

9x2 - 921 - ara2 = 0, 

^31 = ^13 = Ei*E3 = 0, 

923 = ^32 = a2*a3 = 0. 

A curvilinear coordinate system for which these relations hold is called 
orthogonal, and we note that in an orthogonal system the quadratic form 
(2-13) has the structure 

(dsf = 9n(dxi)^ + g22(jdx2f + gssidxsf. (2-14) 

To get at the meaning of the coefficients gn, ^22, and ^33, we note that 
when an element of arc ds is directed along the xi coordinate line, dx2 
dxs ~ 0, since along the Xi line X2 and X3 do not vary. Thus, (2-14) gives 
in this case 

(dsi)^ ^ 9ii(dxi)^, 

so that dsi « (2-15) 

Thus, the length of the arc element dsi along the^i coordinate line is 
obtained by multiplying the differential of Xi by x^gn. Similarly we find 
that the differentials of arc ds, along the X2 and 0:3 coordinate lines are 

d«2 dsg « V^dxa. (2-16) 
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Since the (hi and the dxi are real, we conclude that gn> 0 f ^2 5 ! 0, 
gn ^ 0. In orthogonal cartesian coordinates (ds)^ is given by the formula 
( 2 - 4 ), and hence in such a system gn « ^^22 ® fi^33 *= 1 - 
An element of volume dr in general curvilinear coordinates is defined 
as the volume of the parallelepiped 

dr = |ai’a2 X as\dxidx2dx^ ( 2 - 17 ) 

constructed on the base vectors a,-. If the system is orthogonal, ( 2 - 17 ) 
reduces to 

dr = V'gnjf22jf33 dxi dxs dxs, (2-18) 

as is immediately obvious from ( 2 - 15 ) and ( 2 - 16 ). 

When a curvilinear coordinate system x is determined by equations of 
the form (2-1), we can write the inverse transformation (2-2) as 


Vk = yk(xuX 2 ,X 3 ) ( 2 - 19 ) 

and deduce the metric coefficients Qtj as follows: On differentiating Eqs. 
( 2 - 19 ) with respect to we get 


But in cartesian coordinates 


^Vk = S ““ dx^. 

OXi 


ds^ = X) dyk dyk, 


and the substitution from (2-20) in this formula yields ^ 

3 r 3 A'n, 3 


fs—*-.z— 

ilr, L.tlax, .Tidry 

t-wl }^l \k^l ^Xj/ 

On comparing (2-21) with (2-12), we see that 

^dVkSyk . . , „ „ 

t,J = 1,2,3. 

dXi dXj 


(2-20) 


( 2 - 21 ) 


( 2 - 22 ) 


This is the desired formula for the calculation of metric coefficients. 

To illustrate the use of (2-22) consider a coordinate system defined by 
Eqs. ( 1 - 7 ), which we write in the form 

^ Note that the Bummation index can be changed at will so that 
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Pi 

y% 

2/a 


Xi cos 
Xx sin X 2 , 

^3, 


to agree with tlie notation used in this section. From (2-22) we have 


Qn 




(sr 


\dxi/ 

= cos^ X2 4- sin^ *2 + 0=1, 

= Xi sin® *2 + S'? cos *1 + 0 = * 1 , 


933 ~ 


(?)+©+©' 

\dX3/ \dxs/ xdxz/ 


0 + 0+ I 


1 , 


dXi dX2 dXi dX2 dXi dx2 

~ COS X 2 (-Xi sin X 2 ) + sin a: 2 (a*i cos 2 * 2 ) + 0 ~ 0. 

We find in tlie same way that g 2 z = gis = 0. Hence the system under 
consideration is orthogonal. The expression for ds^ is 

3 3 

= S 2 9i} dxj 

»«=i 

« {dxxf + xl{dX2r + (d^3)^ 

which is a familiar formula for the square of the arc element in cylindrical 
coordinates if we recall that Xi = r, ^2 = = z. Since this s^ystem is 

orthogonal, the element of volume is given by (2-18), which in our case 

dr = r dr d6 dz. 

Example: Obtain expressions for the elements of arc and volume in the coordinate 
system x defined by 

»• Xi + X2 + 

^2 « api - X2 — Xi, (2-23) 

yz « 2xi ^x% — Xi, 

and discuss the system. 

On making use of formula (2-22) we find as in the preceding illustration that 
1^11 •* ffn ^ 1/12 *• 2, 9u ffiz “*“2. 
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<i»* •• 4 dxi dx^ — 4 dxi dxz + 2 dxt dxz + ^{dzi)^ 4* 3(da;a)*. 

The system is clearly not orthogonal, and to compute dr we shall make use of formula 
(247). Now 

r "»iiFi + feyj + lays 

■“ 4- ar» -f ^s) 4- h(zi — xj — xs) 4* *8(2x1 4“ a:* — xa) 

and hence the base vectors a« » dtidxi are 

ai « ii 4- ij 4- 213 , 

ft2 « ii — 4 4- 4, 


Thus, 

dr 


as « ii -* 4 — 4 . 


|ai*a2 R aaldxjdxadxs 


1 

1 

1 


1 

-1 

-1 


2 

1 

-1 


dxi dxa dxa 


4 dxi dxt dx%. 


On solving (2-23) for the x, we get 

Xi «. )4yi 4- Hj/2, 
acj *« ->iyi - %yi 4- 

xs * %vi 4- Mys ~ J^ya. 

The coordinate surfaces Xj — are planes, and the coordinate lines x< are therefore 
straight lines. 

The system in this example is a special case of an affine coordinate system 
determined by the transformation 


yi « a^iXi 4 - at 2^2 + 0*3^3, i ^ I, 2 , 3 , ( 2 - 24 ) 

in which the a,y are constants, AflBne transformations (2-24) occur in the 
study of elastic deformations, in dynamics of rigid bodies, and in many 
other branches of mathematical physics. 


PROBLEMS 

1. Discuss in the manner of the preceding example a coordinate system x determined by 

1 1 

»• “ ' V5**' 
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8 . Compute the metric ooeffideuts appropriate to a spherical coordinate system defined 
by Eqe, (1-9), and thus show that 

(d«)* « (dp)* -f p*(d5)* -f p* sin* d(<4)® 

and dr w p* sin B dp dB 

3 . If R — ix + jy + Ik is the position vector of a moving point P(x,y,s) in cartesian 
coordinates, show that the unii base vectors Or, e«, e* in cylindrical coordinates (r,d, 2 ) 
(see (1-7)1 are 

Or •“ i cos ^ + j sin »■ ~i sin ^ + j cos e» * k. 

Show that R «• rOr 4* *e„ compute dR/di and d^/dt^, and thus show that the velocity 
V and the acceleration a of the fjoint P are 



4. If R «■ ix 4- ly 4* k 2 is the position vector of P{x,y,z) in cartesian coordinates, 
show that the unit base vectors Op, e«, in spherical coordinates defined by Eqs. (1-9) are 

Cp « i sin 5 cos 4“ j sin ^ sin ^ -f k cos 

09 • i cos cos ^ 4“ J cos 5 sin <|!» — k sin 

wf — i sin ^ -f j cos 

6. If the position vector R of a moving point P in spherical coordinates is written as 
R » pOp, where ep is the unit vector in the direction of the increasing coordinate p, use 
the results of Prob. 4 to show that 

dR dp de . dtt> 

3, Scalar and Vector Fields. Gradient. If in some region of space a 
scalar u{P) is defined at every point, we say that u{P) is a scalar point 
function. An example of such a function is the temperature at any point 
in a solid. A function v(P) defining a vector at every point P of Uie 
given region is a vector point function. An example of vector point function 
is the velocity at any point P of a fluid. The regions of definition of 
scalar and vector point functions are sometimes called fields^ and one thus 
speaks of scalar and vector fields. Unless otherwise noted, we shall assume 
that w(P) and v(P) are single-valued functions. 

To facilitate calculations involving scalar and vector point functions, it 
is often convenient to refer the region of their definition to a special co¬ 
ordinate system x. If this is done, the coordinates of P can be denoted by 
(xi,X2,X3) and w(P) and v(P) can be denoted by u{xiyX 2 yXz) and v(xi,X2,X3), 
respectively. As explained in the preceding section, vixiyX^yXz) can then 
be represented in terms of its components yi{JEJi,X 2 ,X 3 ) (x * 1, 2, 3) along 
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the appropriate base vectors at (Ti,.t 2 ,T 3 ). It should be noted, however, 
that the introduction of coordinate systems is a matter of convenience 
and that u(P) and v(P) depend only on the choice of P in the field and 
not on any special reference frame selected to locate P, The fact that 
scalar and vector point functions are independent of coordinate systems 
is spoken of as invariance, and we sliall see that it is possil)le to associate 
with u{P) and v(P) certain new^ scalar and vector functions which have 
important invariant significance. 

We say that u(P) and v(P) are continuous at P if 

lim u{P') ^ u{P) and lim v(F') — v(P) 

p' p 

for every choice of P' in the neighborhood of P. Functions continuous 
at every point of the region are said to be continuous in the region. 



Let u(P) be a continuous scalar function in the given region. 

- > 

select a point 0 in this region for the origin of position vectors r ^ OP 
If P' is some point in the neighborhood of P, we denote OP' by r' and write 


(Fig. 7) 


r's: r + At. 


The difference quotient 


u(P') - uiP) u{P') - u{P) 

- — . —— _ __—- 

lAr| As 


(3-1) 


where |Ar| » A^, gives an approximate space rate of change of u{P), and 
w^e can study the limit of (3-1) as P' is made to approach P along the 
rectilinear path Ar. If this limit exists, we shall write 


u{P') ^ u{P) 

lim - 

0 As 


du 

ds 


(3-2) 
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md call it the directional derivative of u{P) in the direction specified by At. 
A different choice of F' yields a different vector Ar and in general a different 
value for du/de at P. 

A set of points for which u(P) has a constant value c determines a sur¬ 
face S called a level surface; we assume that at each point of S there is a 
uniquely determined tangent plane. Let us consider a pair of such sur¬ 
faces S and S' determined by u — c and w == c -f Ac, where Ac is a small 
change in c (Fig. 8). If P is a point on S and P' on S\ the change Au « 
u(P') — u{P) is Ac, and this is independent of the position of P' on S'. 
But the average space rate of change 


u(P') — u{P) Au 
I Art 


(3-3) 


clearly depends on the magnitude of Ar, The limit of this ratio as Ar is 
made to approach zero by making Ac 0 is the directional derivative 

(3-2) in the fixed direction determined by Ar. ILie greatest space rate 

— -> 

of change of u will occur when P' is taken on the normal PQ ^ An to the 
surface S (Fig. 8), since for this position of P' the denominator |Ar| in 
(3-3) is not greater than | An |, Indeed, 


An ^ Ar cos d, 


where 6 is the angle between the normal PQ to 5 and PP', 
On taking account of (3-4), we conclude that 


(3-4) 


du 1 du du 

--5-S - QQQ 0^ 

dn cos $ ds ds 


(3-5) 


The derivative du/dn in the direct!^’ .he normal to the level surface 
u = const is called the normal derivative oj u{P). 

If n is a unit vector at P, pointing in the direction for which Au > 0, 
we can construct a vector, called the gradient of u, namely, 

du 

grad u — (3-6) 

dn 


This vector represents in both the direction and magnitude the greatest 
space rate of increase of w(P), provided, of course, that du/dn ^ 0. The 
gradient vector (3-6) is clearly independent of the choice of coordinate 
systems and hence is an invariant. If we introduce the familiar cartesian 
coordinates xyz and denote u{P) by w(x,y,z), then, as in Chap. 3, Sec. 8, 

du du dx du dy du dz 
ds dx ds dy ds dz ds 


( 3 ^ 7 ) 
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whaf© dx/ds » C 06 dy/ds » cos (y,«), dz/ds =» coe («,«) are the direc¬ 
tion cosines of the unit vector s in the direction of the arc element ds 
(Fig. 9). In this case the position vector r or P is 


and 


ix + 32/ + kz 
dr 
ds 


. dx dy dz 

17- ++ 

ds ds ds 


(3-8) 



of s coincides with that of the normal 
conclude that 


We see that (3-7) can be written as 
the scalar product of the vector 

du du du 


Vu ^ i -hj-hk — 

dx By Bz 

and the unit vector s in (3-8). 


(3-9) 

Thus, 


~ « Vws. (3-10) 

ds 

Inasmuch as the greatest value of 
du/ds is assumed when the direction 
n to the level surface u * const, we 


Vu = grad u. 


(3-11) 


for the right-hand member of (3-10) can be interpreted as the component 
of the vector Vu in the direction s and the maximum component du/ds 
is obtained when s is directed along Vu. 

It follows from (3-9) and (3-11) that a formula for calculating grad u in 
cartesian coordinates is 

Bu du Bu 

grad u = i-f-j-hk — (3-12) 

Bx By Bz 


On comparing (3-6) and (3-12), we see that 

du 

Igradwl « — 
dn 


f/Bu\^ /Bu\^ TduV 

vU+U'+U 


Formula (3-9) suggests a definition of the differential vector operator 
V, called del or nabla, 

B B B 

Vsi—+ j —+ k-. (3-13) 

dx dy dz 


analogous to the scalar differential operator D introduced in Chap. 1, 
Sec. 23. The product of V and the scalar u(x,y,z) is interpreted to mean 
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(3-9). The reader will show that 

V(u + v) *= Vu + VVf 

V(uv) =s uVv + vVUf 

whenever u and v are scalar functions of (x,y,z), A formula for grad u 
in orthogonal curvilinear coordinates is deduced in Sec. 13. 

The directional derivative dv/da of a vector point function v(P) is 
defined by formula (3-2) in which u(P) is replaced by v(P). When v(P) 
is expressed in the form 

V « it;, + + kt;,, (3-15) 

where I, j, k are the base vectors in the system 


dv 

da 



dvy 

+ j-^ + k 
da 


dvg 

da 


(3-16) 


We have already employed a similar formula in Chap. 4, Sec. 7, to cal¬ 
culate the derivatives of the position vector R = ix + jy -f fcc with respect 
to the time parameter t. 


Example 1. Find the directional derivative of w « xj/** at (1,0,3) in the direction of 
the vector i — J 4- k. Compute the greatest rate of change of u and the direction of 
the maximum rate of increase of u. 

On substituting u ■■ xyz^ in (3-9), we find that the gradient u is given by 


At (1,0,3) 


Vu ■■ iyz^ -f -f- k2xyz. 
Vu « iO + j9 4* kO = 9j, 


Thus, the greatest rate of change | Vw | — 9, and the direction of the maximum rate of 
change is along the y axis. Since the unit vector 8 in the direction of the vector i — j 4* k 
18 


6 


1 


(i-J+k), 


we find on using (3-10) that the desired directional derivative is 
du 1 9 

Example 2. Find the unit normals to the surface x® — y* 4- 6 at (1,2,3). 

The surface in this example is a level surface for the function w ■* x* — y* 4* 
Since the gradient of u is normal to the level surface u *» const, we have by (Sr*!!) 

grad u •« Vu ■■ i2x — j2y 4- k2«* 


which at (1^2,3} has the value 


Vu - i2 - j4 4“ k6. 
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But thifl vector is directed along the unit normal n to u « x® — y* *4- »* *■ 6 in the direct 
tion of increasing w. Hence 


Vu 

n an —- 

|Vw| 


1 


(12--j4+h-6). 


The direction of the other unit normal vector is opposite to this. 


PROBLEMS 

1. Compute the directional derivative of u ** x® 4- 4* at (1,2,3) in the direction 

of the line 

z y z 

3 “ 4 s' 

Find the maximum rate of increase of u at (1,2,3); at (0,1,2). 

2, Find grad u if (a) w « (x® 4- 2/® 4- 2®)'"^, (S) u « log (x® 4" if® 4- «®). 

8. Find the directional derivative of u « x^y — yh — xyz at (1,”-1,0) in the direc¬ 
tion of the vector i — j 4* 2k. 

4 . Find the directional derivative of u » xyz at (1,2,3) in the direction from (1,2,3) 
to (1,-1,-3). 

5. Find the unit normal vector in the direction of the exterior normal to the surface 
ac® 4-2?/* 4- 2 ® “ 7 at (1,-1,2). 

6. Find the unit vectors normal to xyz =» 2 at (1,-1,—2). 

7. Show that Vr^ «» nr^~h, where r « ix 4* j?/ 4” kx and r «• ]r|. 

8. Use the result of Prob. 7 to compute the directional derivative of w « (x® 4- Jf® 4 2 ^®)'^ 
at ( — 1,1,2) in the direction of the vector i — 2j 4- h. 

9. Compute the directional derivative of 

V » i(x® - y®) 4- iixyz - 1) 4- fcs 
at (1,2,0) in the direction from (1,2,0) to (0,0,0). 

4. Integratioii of Vector Functions. Integrals of vector functions with 
the integrands consisting of scalar products of vectors are defined in the 
usual manner. Thus if v(P) is a continuous vector point function specified 
along a curve C joining a pair of points Po, and if r(P) is the position 
vector of P on C, then the integral 



is defined as the limit of a sum constructed as follows. Let (7, which we 
suppose to be sectionally smooth,' be divided into n arc elements As,* 
by inserting the points P,* (Fig. 10). We form the sum 

Zr(Pi)-iai, (4-2) 

* Hiw means that C consists of a finite number of segments with continuously chang- 
tug tangents. The tesin piecmise smooth is also used. 
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where Ar* « — r<, and compute the limit of this sum as n —> oo 

and eveiy (Art | —> 0. The continuity of v(P) and the smoothness of C 
suffice to show that the limit of (4~2) 
exists, and we define the line integral 
(4-1) to be this limit. 

If v(P) is defined in some region 
containing several paths joining Po 
and P, then the integral (4-1) will 
ordinarily have different values when 
computed along different patlis. In 
exceptional circumstances, discuased 
in the following sections, these values 
may turn out to be equal. 

If we introduce the xyz coordinate 
system and write 

v{P) = 'f{x,y,z) S Wx{x,y,z) + }Vy{x,y,z) + Vs)^{x,y,z), 
dt ^ i dx + } dy + k dz, 
the integral (4-1) becomes 

P' 

, Mx,y,z) dx + Vy(x,y,z) dy + v,{x,y,z) dz]. (4-3) 

0 

^Vhen the equations of C are given in parametric form 





X x{t), 

y = y{t), 

Z = 2 («),_ 


to <t <t‘ 


(4-4) 


where the values to, i' of the parameter t correspond to the end points 

rt' 

of C, the integral (4-3) can be expressed as a definite integral' / F{t) dt and 

JtQ 

evaluated by the usual means. 

Similarly, the surface integral 

f vender (4-5) 

JZ 


where n is a unit normal specified at all points of a suflSciently smooth 
surface * S, can be defined as the limit of the sum 

^ See the examples at the end of this section. The equivalence of the integral (4-3) 
and the ordinary Riemann integral, when (4-4) holds, is easily seen by comparing the 
sums of which these integrals are the respective limits. 

* We assume that the surface IS is two-sided and that n is directed toward one side. 
This normal we elect to call positim. If the surface is closed, it is customary to regard 
the exterior normal as positive. 
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lim 2:T(P<).n(P.) Atr*. 


In constructing this sum it is supposed that the surface S is divided into 
k elements of areas Act* and F» is chosen somewhere in the element Acri. 
The limit is then computed by increasing the number k of elements in 
such a way that the maximum diameter of every Acr, approaches aero. 

Formally one is tempted to extend these ‘‘limits of the sum" definitions to such 
ssrmbols as 

in which v{P) is a vector function and ds, da, and dr, respectively, are the elements of 
arc lengtli, surface, and volume. Thus, there is a temptation to define the volume 

integral j^{P) dr by the formula 

f * 

/ v(P) * - lim E (4-8) 

Jt k’-* « f.1 


in which it is imagined that the volume r is divided into elements of volume At,. A 

k 

definition such as (4-8) requires forming sums 53 Ar, of the hound vectors v(F,) 

which are determined at different points of the body. There is a question if the rules 
for addition of free vectors given in Chap. 4, Sec. 2, can be used to provide a sensible 
definition of (4-8). Without going into details we state it as a fact that the definition 
(4-8) makes sense in those geometries where the distance between a pair of points is 
given by the Pythagorean formula.* 

If v(P) is expressed in terms of its cartesian components as v » lvg(x,y,z) -f ivj,(x,y,z) + 
k*>,(x,v,«), the integrals in (4-7) can be reduced to the evaluation of three ordinary inte¬ 
grals by writing, for example, 

j v(P) dr ij Vadr -i- ij Vpdr -i-kj Vg dr. 


No such simple means of evaluating integrals of the type (4-7) are available in curvilinear 
coordinates because the base vectors in curvilinear coordinate systems vary from point 
to point in space. This remark may serve to explain why cartesian coordinates are so 
pronunent in calculations involving vectors, 
line integrals of the form 

f [P(x,y,«) dx + Q(x,y,z) dy + P(x,y,«) dz], (4-9) 

Jc 


which is identical with (4-3), are frequently defined without reference to vectors, but as 
we shall see, the definition adopted here has many interesting and immediate physical 
interpretations. 

* Spaces so metrised are called Euclidean, and it is only with such that we are oon- 
eerW in this book. 
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Mxampk 1. Evaluate the integral / t-dt when C ia the helical path 

Jc 


X ^ cos if 

1 / - sin I, (4-10) 


« -f, 

joining the points determined by < « 0 and t « r/2 and also when C is the straight line 
joining these points. 

Since r *» ific -f jj/ 4- kai, we get, on using (4^10), 

f -■ i cos < 4- j sin < -f to, 
dr «• (—i sin < 4- j cos t -f- k)dt. 
r ^ 

Hence / r*dr «* / idi ^ —• (4-13) 

JC Jc 8 

If the path C is a straight line joining the same points (1,0,0) and (0,l,ir/2), we can 
write its equation in vector form as 

r « n 4“ (rs - ri)f, (4-12) 

where ri and rj are the position vectors of (1,0,0) and (0,1 ,t/ 2), respectively (Fig. 11). 



The parameter t clearly varies between 0 and 1, since for i •» 0, (4-12) yields r ■■ n 
and for f •« 1, r » t%. But ri — i, rj « j 4- (ir/2)k, so that (4-12) reduces to 

r-i + (j+^k-l)«. 

Hence /c"’* t)“ T 

This is the same value as we got for the helical path. In the following section we shall 
•M why this particular integral is independent of the path. 



VECrOB EEEtS TBODOBT [CHAP. 5 

Example 2. (Sbtnpute tiie value of / v*<fr, where v «■ 4* ^^ the straight 

JC 

line joining (0,1) and (1,0). Discuss also when C is the arc of a circle centered at the 
origin. 

4 ( JiSidce r *« ir *f ji/, we have dr « i cb H- j dy and thereiore 

f v^dT ( (ydx+2xdy). (4-13) 

To evaluate this integral along the rectilinear path in Fig. 12, we write the equation of 
the path in the form 

y *• —x 4" 1 (4-14) 

and insert (4-14) in (4-13). Since dy » —dx, we get 

. j f v-dr • f [(—x 4* 1) da? — 2x dx] « / (1 — 3x) dx - —M- 

’ ’ f Jc Jo Jo 


7 ^he^iptegr%tionhere is performed so that the path C is traced from the point (0,1) to 
^1.0). 



To compute the value of the integr al (4-1 3) over the circular path C' joining the 
same two points, we note that y «• \/i — along C\ dy ^ —x dx/\/l — x* so that 



■fM 

■t 


1 — X* dx 


2x® 


'JLzifl 

\/i — a?^ 


dx 


vr 

rr 

4* 




Again, the path C" is traced out from (0,1) to (1,0). If the direction of description of 
C* is reversed, so tliat the circle is traced out from (1,0) to (0,1), the limits in the inte¬ 
gral must be interchanged and we get 4*v/4 for the value of the integral. 

Example Z, Evaluate / v*dr, where v « (iy — ix)/{x* 4* i/*) and C is the circular 

’if V i^rt V , 

path** + »* «« 1 described coimterclockwise. 
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This integnil c an be evftluated as in the preceding example hy substituting in the 
integrand y Vl — points on the upper half of the circle C? 
on the lower half. It is simpler, however, to write the equations upk^para* 

metric form ,,r, 


We thus get 


X »* cos S, ] 

[ 0 < S < 2ir, 

y «■ sin S, J 




(4.15| 

' Hi * ’ t ij U' (.''I 


r ■■ k + jy «■ i COS S + j sin S, 


dr » (—i sin ^ -h j cos ff) d0$ 


V 


i sin ^ — j cos 0 
sin* 0 4- cos* 0 


i sin 0 — J cos 0, 


n/J 

2 . ' » 


Hence 




sin* 0 — cos* 0) dB 


~2v. 


If the path is traced in the clockwise direction, we get +2r. ' '' ^ ^ 

It may prove instructive to evaluate this integral over the square C' formed by the 
lines X « ±1, y * =1=1 (Fig. 13). ‘ , ? 

Tlie int/Cgral over C' is equal to the sum of four integrals evaluated oVCr the paths 
PQ, QR, HS, SP, 

Now along PQ, y « —1, dy -« 0, r «» ir — j, dr « I dx, and v «. ( — i — jx)/(at* + 1). 


Hence 



tan“ 


1 

M 

-1 


T 

2* 


Along the path QR, x * 1, r « i + jy, dr « j dy, v «* (iy — j)/(l -f y*), so that 


[ v*dr «* [ 
JQR J -i 


1 +y* 


In a similar way we find that 




/ v*dr *« / v*dr * — 

Jrs Jsp 2 

so that the integral 

£,v.dr-4(-0»-2.. 

This time we obtained the same result as we did for the circular path^. 'lnBeci tb^weshoU 
see that this is not an accident and that the value of this inWgral for 
enclosing the origin is — 2ir. 


PROBLEMS 


1, Evaluate the integral in Example 2 over the path C consisting o^ straigbt-linp, 
segments joining the points (0,1), (0,0), (1,0) in that order. 

2. Evaluate the integral in Example 1 over the polygonal path joining the points 
(1,0,0), (1,1,0), (1,1,v/2) in that order. 
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9. Compute the value of the integral jjixy dx — y dp -\-dg) over the following paths; 

(a) Straight line joining (0,0,0) and (1,1,1), 

(h) Straight line joining (0,0,1) and (0,1,1), 

(c) Straight line joining (0,0,0) and (1,2,3). 

Note that this integral has the form v*dt. 

4u Compute the integral where v •« to — Ji/ -f ks over the helical path in 

Example 1. Also evaluate it over the rectilinear path. 

9. impute the work W done in displacing a particle of unit mass in a constant 
gravitational field F « —kor along the following paths: 

(o) Straight line joining (0,0,0) and (1,1,1), 

(f>) A polygonal path joining (0,0,0), (1,1,0), (1,1,1) in that order. HintiW fw*dt. 

Jq 

6. Line Integrals Independent of the Path. A special case of line integral 

[vxix,y,z) dx + Vy{x,y,z) dy + v,{x,y,z) dz], (5-1) 

in which v(x,|/,z) is known to be the gradient of some single-valued scalar 
uix^y^z) specified in the region R containing C, frequently appears in ap¬ 
plications. Now, if V s® VUf then 

du du du 

v*dr « Vu'dr = — dx-1- dy ^ - dz 

dz dy dz 

m du, ( 5 - 2 ) 

and thus the integrand in (5-1) is an exact differential. We can, therefore, 
write 

fc 

where Pq and P are the end points of the path C, 

This result is unique since u, by hypothesis, is single-valued. More¬ 
over, since it depends only on the end points Pq and P, we see that the 
value of the integral in (5-3) is independent of the path joining these points. 
If Cl and C 2 are two different paths shown in Fig. 14, then 

/ Vwdr « f Vu-dr, (5-4) 

/Po -fPo 

Cl Ct 


But along C2, 
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and we can therefore write (6-4) as 

fP fPt 

I Vu-dr + I Vu dx 0 

Jpt Jp 

Cl Ct 

or f^Vudr = 0, (6-6) 

where C is the closed path formed by Ci and C 2 . 



The results embodied in (5-3) and (5-5) can be stated as a theorem. 

Theorem I. The line integral fc ^^ dt, in which u is a single-valued 

contimwusly differentiable function in a given region R, is independent of the 
path^ and hence it vanishes for every closed path drawn in R. 

At first glance Theorem I appears to contradict the result in Example 3 

of Sec. 4, where the integral v*dr with v = (it/ — ix)/(z^ + y^) was 

considered. It is easy to check that v = —V tan~^ {y/^)f so that in this 
case u = — tan""^ iy/^)- This integral does not vanish when evaluated 
over any closed path including the origin because the function tan'~^ iy/x) 
is multiple-valued. Also, the continuity requirement of the theorem is 
not fulfilled by v » Vu at (0,0). 

We can also establisli another important theorem which is a converse 
of Theorem 1. 

Theorem II. If a vector point function v is continuous in a given region R, 

and if the integral v* dr is independent of the path^ then a single-valued 

scalar u exists such that v « Vw in R, 

We shall prove this theorem by actual construction of the function 
u{Xjy,z) fulfilling the conditions of this theorem. 
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By hypothesis, the integral V'dx when evaluated over any curve 

C joining Po(a^,yo,*o) with P(x,y,z) is independent of the path and thus 
defines a single-valued function 

f(3:.V,9) 

u{x,y^) =» / (pxdx + Vydy + v, dz). (5-6) 

We shall show that this function is, indeed, such that v = Vu, 

On replacing a; by x + Ax in (5-6), we get 

riz+Ax, V, 0 ) 

u{x +• Ax, y, 2 :) = / {vz dx + Vydy + v* dz) (5-7) 

(*0*1/0. «o) 

and on subtracting (5-6) from (5-7), we obtain 

r{x-¥Ax, y, z) 

u(x + Ax, y, z) — u{x^y,z) « / dx + Vydy + Vg dz), (5-8) 

J(x,y.z) 

The integral in (5-8) is independent of the path joining (x,y,z) with 
(x -f Ax, yy z)y and it suits our purposes to evaluate it over the rectilinear 
path y =* const, z «= const. Over such a path dy ^ dz - 0, and hence 
(5-8) 3 del<is 

rx-i-Ax 

u(x + Ax, y, z) - u{x,y,z) = / Vx{x,y,z) dx. (5-9) 

Jx 

But by the mean-value theorem for integrals 

tx-k-Ax 

Vx{x,y,z) dx = t>i({,y,«) Ax (5-10) 


where a: < f < a: -|- Ax. The substitution from (5-10) in (5-9), on dividing 
by Ax, gives 

m(x + Ax, y, z) - u(x,y,z) 

-;;-=■ i>x((,y,z). 

Ax 


Now, if we let Ax -♦ 0, we get 


du 

— “ »*(x,y,z) 
dx 

(5-11) 

by recalling the definition of partial derivative and by the fact that Vx 
is continuous. In a similar way we prove that 

du 

— = Vy{x,y,z) 
dy 

(5-12) 

du 

ai»l — “ o,{x,y/). 

dz 

(5-13) 
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But the statements to (5-13) are equivalent to the vector equation 
Vu « V, and the theorem is thus proved. 

It should be carefully noted that the key hypothesis which ensured the 
existence of a single-valued function u such that v « Vu is that the integral 

v*dr is independent of the path. The integrand v*dr « dx + Vy dy 
+ Vgdz may be an exact differential of a multivalued function u, in which 
case the integral ^ V'dr may depend on the path. 

A differential form 


v^{x,yyz) dx + Vy(x,y,z) dy + v„{x,y,z) dz, (5-14) 

in which y*, Vy, v, are continuously differentiable single-valued functions, 
is said to be exact if 

du du du 

Vx dx + Vy dy + v, dz ^ — dx -i - dy - dz, (5-15) 

dx dy dz 

where u is not necessarily single-valued. We can deduce a set of necessary 
conditions for (5-14) to be an exact differential as follows: If there exists a 
function u{Xjy,z) such that (5-15) is true, then on setting x = const, y == 
.const, z = const, in turn, we get 


(546) 


du du du 

as: 1 f) 5 SS —~f BB 

dx dy dz 

Differentiating the first of Eqs. (5-16) with respect to y and the second 
with respect to x, we get 

dVx d^u dVy d^u 

dy dx dy dx dy dx 

But the mixed partial derivatives in these expressions are equal, since 
dvxidy and dvy/dx are continuous by hypothesis (see Sec, 2, Chap. 3), 
Thus 


dVx dVy 
dy dx 

In a similar way we obtain two more relations 

dVy dvg dVg dVx 

dz dy dx dz 


(5-17) 


(6-18) 


The relations (5-17) and (5-18) give a necessary condition to be satisfied 
by the functions Vxj Vy^ Vg in (5-15) if that differential form is to be an exact 
differential of some function u{x^y^z). We shall see in Sec. 12 that these 
conditions suffice to ensure the existence of a function u such that (5-14) 
is equal to du. However, the conditions (5-17) and (5-18) do not guarantee 
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that u is single-valued. The question naturally arises: What supplementary 
conditions must be adjoined to Eqs. (5-17) and (5-18) to ensure that u 
IB sin^e-vaiued? A complete answer to this question is complex because 
it depends not only on the differentiability properties of «;x, Vy^ Vg but also 
on the geometry of the region in which these functions are defined. Tf 
the region of definition of these functions is simply connected and suf¬ 
ficiently regular to permit the use of certain integral transformation theo¬ 
rems discussed in Secs, 9 to 11, then u(x,y,z) determined from the formula 

u(x,y,z) « / (vx dx + Vydy + dz) (5-19) 

is single-valued. We describe these restrictions on the character of the 
region in the following section. 


PROBLEMS 

1. Show that the integral ^ r-dr is independent of the path and find its value when 

computed over the rectilinear path joining (0,0,0) and (1,1,1). Hint: r«dr » 

2. Show that {y — x^) dx 4* (i^ 4- V®) dy is an exact differential du, and,find u(T,y). 
8 . Show that the conditions (5-17) and (5-18) for an exact differential can be written 

in symmetric form as 

i j k 

1. 1, 

dx dy dz 

Vx Vy Vx 

4. (a) Is yzdx -i^zxdy +xydz an exact differential du? If so, find u(x,y, 2 ). (6) 
Evaluate the integral (yz dx -j- zxdy xy dz) over the rectilinear path joining 

(0,0,0) to a fixed point {x,y,z). 

X y f 

6 . If V «• i '-r-; 4 j -r, show that / v*dr « 0 for every closed path that 

X* 4* y* 4 Jc 

does not include the origin. What is the value of this integral over the circular path 


V X v 


4 1/* *• 1? Find u such that du « VU'dr. 

6. 

7. Find a function u such that 

X 


y ~ JL i X AiaVA Mr 0\4.VXft VAaOrV M'Mr — V ’ VIr* # 

B. Compute / Vwdr where u « log (x* 4 and C is the circle x* 4 “ !• 

Jc 


du 


: dx — 


z dy if X* > 2 /^ 


8, Ifv 
through (0,0,0). 


1 r 

V -, compute j 
r Jc 


ix.V,K) J 

V ~ • dr over some simple path that does not pass 
(« 0 -vo.*o) ^ 


TRANSFORMATION THEOREMS 


t. Simply Connected Regular Regions. The validity of several im¬ 
portant theorems op the transformation of surface and volume integrals 
presented in the following sections hinges on the regularity and connec- 
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tivity of domains oS definition of functions appearing in the integrals. 
A careful characterisation of such domains is extremely involved and is 
quite out of place in this book, but in order to aid the reader in imder- 
standing the circumstances under which the theorems in question are vaUd| 
we give a qualitative discussion. 

We shall say that a given region is connected if every two points of it can 
be joined by a smooth curve that lies entirely in the region. A region is 
simply connected provided that every simple closed curve ^ drawn in its 
interior can be shrunk to a point by continuous deformation without cross¬ 
ing the boundaries of the region. 

Thus, the interior of a square is simply connected, but the interior of a 
ring bounded by two concentric 
circles Ci and C 2 is not (Fig. 16) be¬ 
cause a closed curve C surrounding 
C 2 cannot be shrunk to a point with¬ 
out crossing C 2 . Also, the interior of 
a sphere is simply connected, and so 
is the interior of the region bounded 
by two concentric spheres, but the 
interior of a torus (an anchor ring) 
is not simply connected, A region 
that is not simply connected is called 
multiply connected. 

In dealing with bounded three- 
dimensional regions we shall say that 
the bounding surface iS is smooth if 
at each point P of the surface one 
can erect a normal n(P) which changes continuously as P moves along the 
surface. A surface that can be subdivided by smooth curves into a finite 
number of pieces each of which is smooth is called sectionally smooth or 
piecewise smooth. The surface of a cube is an example of a piecewise 
smooth surface. 

The surfaces which we shall consider have two sides, although not all 
surfaces are two-sided. A one-sided surface can be formed, for example, 
by gluing the ends of a long strip in such a way that the upper side of one 
end of the strip is joined onto the under side of the other end (Fig. 16). 

If two oppositely directed normals PN and PN' are drawn at any point P 

of the surface, then the normal PN when carried along the path PABCP 

will coincide with PN'. It may be noted that this surface has a simple 
closed curve as its boundary. 



‘ We recall that a simple closed curve ie a closed curve consisting of a finite number of 
nonintersecting smooth curves. 
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We shall suppose that all surfaces with which we deal are two-sided, 

piecewise aBooth, and such that for 
some orientation of cartesian axes the 
projections on the coordinate planes 
consist of the interiors of simple closed 
curves. Such surfaces we shall term 
regvlar. If a region is a union of 
finitely many regions each bounded by 
a regular surface, it will also be called 
regular. 

Regions bounded by a cone, a 
sphere, or a cube are regular simply 
connected regions. The interior of a 
torus is an example of a regular 
multiply connected region, 

7. Divergence. Let a continuously differentiable vector point function 
v(F) be defined in a regular simply connected region R bounded by a 
closed surface <r. The surface integral of the component of v in the direc¬ 
tion of the exterior unit normal n(P) to <t is called the flux of v over 
Thus, the flux F is 

f asrjfvnda. (7-1) 

When V is the velocity of an incompressible fluid, the scalar F represents 
the amount of fluid issuing from <r per unit time. The points of the region 
at which the fluid is generated are termed sources, and those where it is 
absorbed are sinks. When the total strength of the sources is greater than 
that of the sinks, the flux is positive; when the strength is less, the flux is 
negative. 

Consider, now, a volume element r containing within it a point P, and 
denote the bounding surface of t by v. Then the flux of v over <r per unit 
volume is 

j v*nd(r 

-!- ( 7 - 2 ) 

T 

If we let the volume r shrink to zero in such a way that the maximum 
diameter tends to zero, the quotient (7-2) will have a limit called the 
divergence of v at P. We denote the divergence of v by div v(P) so that 

j^v-nd^r 

div v(P) « lim -- 

T 0 r 

This quantity is a measure of the strength of the source at P. 



(7-3) 
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Inasmuch aa the volume r is arbitrary, the existencje and, indeed, the meaning of Un© 
limit in (7-S) are not quite obvious mathematically. One may let r approach zero while 
staying similar to itself, or one may let r become arbitrarily thin compared with its 
length, and so on. It is tolerably clear when suitable restrictions are imposed that 
aU these processes yield a unique limit L, independent of the shape of r. Moreover, 
the convergence is uniform in the following sense: Given any « > 0, there is a $ > 0 
such that 

^ V • n dcr 

-L < «, (7-3<*) 

T 

provided the maximum diameter of r is less than B. For rectangular solids r tliia fact 
is established in the next few paragraplis, though the proof in the general case is not 
presented here. 

To calculate div v in cartesian coordinates we consider k volume r in 
the shape of a rectangular parallele- 
pi{)ed with center at P{x^y^z) and 
with edges At, Ay, Az (Fig. 17). 

The flux of V over the surface of this 
parallelepiped is easily computed. 

Since v == + ]vy + la;*, the nor¬ 

mal component v*n of v over the 
face A BCD is Hence the outflow 
over that face is (?;x)a'+H ax Ay Az, 
where h mean value of 

Vx over A BCD. Similarly, the out¬ 
flow over the parallel face EFGH is 

i-Vx)x^H^x Ay Az, 

where the minus sign appears be¬ 
cause the exterior normal to EFGH is ~i and hence v*n == — t;,. 

Thus the net outflow over a pair of faces parallel to the yz plane is 

x-f 

[(t’x)xfHAx ~ (?^x)x-HAx] Ay Az - Vx Ay Az. 

x — H Ax 



Proceeding in the same way with the remaining faces we get for the total 
outflow 

x+J^Aaf z+HAz 

f v*ii do- = Ay Az + Vy Ax Az + v. Ax Ay. 

* X — HAx y —HAy at —HAi 



A* 
H A« 

Az 


div v(P) =« lim 




(7-4) 
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as Ax, Ay, and A 2 ; approach zero in any manner. Now, the three limits in 
(7*4) are the respective partial derivatives, so that we obtain the important 
formula 


div v(F) 


dVx 

dVj, dVg 

—+ 

— H- 

dx 

dy dz 


(7-5) 


The fact that the limits are partial derivatives is suggested by the definition of partial 
derivative (cf. Chap. 3, Sec. 2), Further discussion is required, however, because the 
functions »*, are mean values. By the theorem of the mean (see Eq. (3-7), Chap. 3j 

G(z + }4Ax, yi, zi) H Ax, yi, zi) ^ ^, 

---- 

Ax 


where 0% stands for dO/dx and where { is between x — Ax and x -f H If 
is continuous, then 

l< 3 i(«,n,*i) - Oi(.x,y,*)\ 

is as small as we please provided only that 

U -*l< As/, 1 * -*iI<Ma* 

wiih Ax, Ay, and Az sufficiently small. Hence the mean value 


Ay Az 


rX-i-H a* rV+H Av 

/ / Gi{^,yi,z\) dyi dzi 

Jz—H Ac Jy~H L 


i Aj/ 


is as close as we please to Gi(x,y,z), and the limit is therefore Gi{x,y,z). Applying this 
result to (7-4) with O ^ Vx gives dvz/dx for the first limit, and the others follow by 
symmetry. 

The analysis shows that we may let Ax, Ay, and Az approach zero in any manner. 
For instance, if Ax 0 first, then Ay 0, and finally Az 0, the volume becomes 
a plane, a line, and finally a point. On the other hand if we set 

Ax *» ah, Ay bh, Az =» ch 

where a, b, c are constant, and let h —* Q, then the volume stays similar to itself. Not 
only is the same limit obtained in all such cases, but the departure from that limit is 
seen to be uniformly small, provided only that 

niax||Ax|,|iij/|,|Azl 1 

is small. The remarks made in connection with (7-3o) are thus verified in this case. 


In terms of the differential operator 


d d d 

V = i —-f j —+ k~ 
dx dy dz 

introduced in Sec. 3, we can consider a symbolic scalar product 


V‘V 


/a d d\ 
(* 7" + j T" + r) ■ 

\ dx dy dz/ 

dVx dVy dVg 

dx dy dz 
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On comparing this with (7-5) we see that 

div V = V-v. (7-6) 

We can also define the Lapladan operator V® by the formula 



a® a® a® 


and observe that if v = Vu, then 


(7-7) 


div Vu = 


(7-8) 


Furthermore, if the symbol V x v is defined by the rule for computing 
vector products, we get 

i j k 

d s d 
V X V = — — — 
dx dy dz 

VZ Vy 



It is worth observing that the condition V x v = 0 requires that each 
component of the vector V x v be zero. We can therefore write Eqs. 
(5-17) and (5-18) (which ensure the existence of a scalar u such that 
V = Vu) in the compact form V x v = 0. 

In Sec. 13 we shall deduce a formula for div v analogous to (7-5) when 
the vector v is referred to an arbitrary orthogonal curvilinear coordinate 
system. It is important to note that the definition (7-3) is independent 
of the choice of coordinates, so that div v is an invariant. 

Example- If v « -f fcry/, compute div v at (1,2,3) and V X v at (x,y,t). 

Since Vx «« 3x^, Vy ** v, » xyz^, the substitution in (7-5) yields div v » 6x 4* 
lOxy 4* 3xi/z^. At the point (1,2,3) div v « 6 4 20 + 54 « 80. If v is interpreted as 
the velocity vector of fluid particles, we conclude that the point (1,2,3) is a source of 
the fluid. 

To compute V X v we use formula (7-9) and find 

V X V » i(zz® — 0) 4 j(0 — yz^) 4 k(6i/* — 0). 

Since this vector is not identically zero, we conclude that no scalar function u(a5,v,«) 
exists such that v * Vu, 
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PROBtBMS 

Pind div v if (a) v « k + -f fcr* (6) v « iix/r) + Kv/^) + where r «• 

Vx* 4-1^ -f {c) V « i(a - j/) -f j(:c - z) -f k(y - a:). 

%. Compute V®(l/r) and vV, where r » Vx® H- 

3. Shoi^f'that (a) div (u + v) « div u + div v, (6) div (uv) - V*(mv) » Vwv + wV* v, 
(c) (liv(iiK Y)''^ V*^u X v) «* v-(V Xu) — U‘(V X v). 
i, Show'tiiat div ^ a) « 0 if r « ix -f- j|/ -f and a is a constant vector. 

5. Find div (wv) if w » x* 4" I/* + aud v » ix 4- jy + hz- Also find div (Vw x v). 

8. The Divergence Theorem. An 

important relationship connecting 
the surface integral (7-1) for the flux 
of a vector field with the volume in¬ 
tegral of its divergence is deduced in 
this section. The resulting integral 
transformation theorem, known as 
ttie Gauss or divergence theorem^ is 
fundamental to all developments in 
mechanics of continuous media. 

Let a continuously differentiable 
vector function v(P) be defined in 
a regular simply connected region r 
bounder! by the surface cr. We sub¬ 
divide r into k volume cells Ar* in 
the shape qf rectangular boxes and parts of boxes (Fig. 18) and compute 
the divergence 

i . . 

divv(P,) = lim - (8-1) 

' ' Ar.-fO Ar^ 

for each celhAn. [The role of r and <t in (7-3) is now taken by Ar* and 
Afaj*!.] On )ra 9 aIJirig liie definition of limit, we can rewrite (8-1) in the form 

/ v*nd<r = (div v), Ar, 4-c.'Ar,', (8-2) 



the ^ '0 as At* 
forhi Ihe shm ’ ' ^ 

^ ^ I: f v-ndrr 


0 and where (div v)* s div v(P»). 

k k 

= 2 v), At< -f 52 «f At, 

»aa«i lawl 


We next 


(8-3) 


over all the cells i^nd observe that the surface integrals in (8-3) over the 
interfaces of adjacent cells vanish, since the exterior normals n to the 
eoUE^on laces of ithe boxes point in opposite directions. Thus the surviv¬ 
ing terms in the sum on the left in (8-3) correspond to surface elements 
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belonging to the exterior surface tr, and hence this mim is ecjtisd to / An 
The sum ^ (div v), Ar» approximates the volume integral / div v dr^ 

t«1 

and indeed, the approximation can be made as close as we wish by suitably 
decreasing the size (that is, the maximum diameter) of t)t^ 

k 

sum of terms involves products of small quantities and Ar^, 

»■« 1 

and it becomes arbitrarily small ^ in the course of the process described. 
We thus conclude from (8-3) that ^ » m • 


j v*nd<7 =* j div V dr. 


(8-4) 


The result embodied in this formula is the divergence theorev/i. This theoreth 
expresses certain surface integrals as volume integrals, and sihce it con¬ 
tains no reference to any special coordinate system, the result is true in all 
coordinate systems. In particular, if v and n are expressed in terms of 
their cartesian components 


n = i cos {x^n) + j cos (?/,n) + k cos ( 2 ,n), 
we can vrite (8-4), on recalling (7-5), as 

f r/dVt dVy dvz\ 

/ [Vx cos (j,n) -f Vy cos (?/,n) -f- cos (z,n)] da - [ -1-1- ]dr, 

'^r\dx by bzf 

(8-5) 

Verify the theorem (8-4) for v «> i(j:/r) -f \{ylr) + k(2/r), where r 
-h -j- 2^ and the region t is the sphere x* -j- j/“ -f < a". 

We readily find that 

QVx bVy ~ dt^g ^ ' 

dx ” Vy “ ^ 

60 that by (7-5) ^^2 _ ^2 2 

div V *-;— =a 

r r 


Now j div V dr ^ j - dr ^ ^ 

^ Of course, the number of cells k must increase without hmit as this process is carried 
out. 

• This is true by virtue of the uniformity emphasized earlier [see (7-3o)]. Thus, given 
€ > 0, we can make the subdivision so fine that t <i| < c for all the «,$ alt bnc^i* In th*tt 
case ^ ’ 

ISe* At» 1 < eS eF, ^ 


where V is the volume of the region. 


\ln. '* 


I 
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and it is easy to evaluate this integral in spherical coordinates, since in spherical coordi¬ 
nates dr •* ^ mn 6 d0d4> dr (see Prob. 2, Sec, 2), We have 

f /* 2 fa f*^^ 2 

/ div V dr I ^ dr S I I I - 1 ^ Bin B d0 d4> dr 4-xar, 

Jr hr Jo h Jo r 


On the other hand 




1 *d<r ** 4ra , 


since v*tt « 1, for n « i(x/r) + i(y/r) + k(z/r) is directed along the radius of the sphere. 
Example 2. Prove with the aid of the divergence theorem the relation 


jvudr ^ jun d<r, 


( 8 - 0 ) 


where u is a continuously differentiable scalar point function. 
Now in cartesian coo.dinates 

n BE i cos ix,n) 4* j cos (y,n) -f* k cos («,n) 

« i(n-i) -f j(n*j) + k(n-k) 


and 


du , du du 

Vu i -hj- f-k—, 

dx dy dz 


SO that (8-6) is equivalent to the three equations 

» J{ju)-nd<r, 

^likuhnd.. 


But these are the special cases of formula (8-5) applied to vectors v «■ iu, v *» ju, v « kw, 
and thus the correctness of (8-6) is established. 

Formula (8-6) can serve as a basis for a definition of Vw in the form 


Vu *« lim 
r—0 


j un da 
J<r 


(8-7) 


analogous to (7-3). 


PROBLEMS 

1. Prove that /r*n d<r « 3 t, where r is the position vector of a point on the surface 

Jff 

of a regular simply connected region of volume r. Hint: Apply the divergence theorem 
to the surface integral. 

% Compute f vender, where a is the surface of the cylinder a;® -f- y* =■ bounded 
J<r 

by the planes * «« 0, s « and where v » hr — jy 4- ke. 
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8. Find where r is the position vector of points on the surface of the ellipsoid 

(xVa*) + 4- {^/c^) - 1. 

4. Find the value of I v *11 d<r, where v » + jy 4- « x* 4 - y* 4- and 

or is the surface of the sphere x* 4- 4- Compute the integral directly and also 

with the aid of the divergence theorem. 

5. If V »• Vu and V^u » p, where p is a specified scalar point function, show that 

Hint: Recall that 

du 

dn 

C Use the divergence theorem to show that 

jTdiv (m Vv)dr «= ju Vv’ll da. 

Show that this equation can be written as 

dv 


JuV^vdr « Ju^da — Jvu‘Vv dr. 


This important relation is known as Green's first identity. 

7. Using Prob. 6 obtain the symmetrical form of Green’s identity, namely, 


j(uVh} — vV^u) dr ^ j 


which is also known as Green's second identity. (It is assumed in this identity that both 
u and V have continuous second derivatives.) Green’s identities are perhaps the most 
frequently encountered transformation formulas in mathematical physics. 

8 . If the twuce-diffcrentiable function* u satisfies Laplace’s equation « 0, what is 
f du 

the value of / dtr? Hint: Set v « 1 in Green’s second identity, Prob, 7. 

Jff dn 


9. Green’s Theorem. Line Integral in the Plane. Because of the im¬ 
portance in applications of line in¬ 
tegrals defined over plane curves, 
we deduce here a special form of 
the divergence theorem commonly 
called Greenes theorem in the plane. 

Let a vector function 

V = ivx(x,2/) + ivy{x,y) (9-1) 

with continuously differentiable 
components Vxt Vy be defined in the 
plane region R bounded by a simple Pici. 19 

closed curve C (Fig. 19). If we con¬ 
struct a right cylinder of height h with base R and apply formula (8-5) 
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to the region r bounded by this cylinder, we get 


I («?* cos (x,n) + cos ( 2 /,n)] dor 



(9-2) 


The exterior unit normals n to the top and bottom bases of the cylinder 



thus write (9-2) as 


are k and — k, respectively, and 
hence cos (.r,n) - cos (y,n) = 0 on 
the bases of the cylinder. The con¬ 
tribution to the surface integral in 
(9-2) from the bases is, therefore, 
zero, and the integral need be eval¬ 
uated only over the lateral surface. 
The element of surface da of tiie 
lateral surface in da ~ h ds^ where 
ds is the arc element of (7, and the 
volume element dr can V>e taken in 
the form dr — h dx dy. We can 


f f f / 

/ [Vx cos (x,n) -f- Vy cos (i/,n)]/i ds^ I-1- ]hdy dx. (9-3) 

JC by! 


where n is the exterior normal to C. But from Fig. 20 


cos (x,n) 




cos (y,n) 


dx 

d8 


(9-4) 


so that on dividing by A, Eq. (9-3) yields 


Ic = IL (S+5) 


(9-5) 


where in tracing C the region R remains on the left; that is, the path C 
is described in the positive direction. 

Formula (9-5) is Green's theorem in the plane. The function -Vy{x^y) 
is sometimes denoted by M(x,a/) and Vx{x,y) by N{x,y)y so that (9-5) 
assumes the form 

/c ^ ^ 

Our restrictions on Vx and Vy demand that M{x,y) and N{x^y) be continuous 
and have continuous partial derivatives in the plane region R, 

'We see that if dMjdy = bN/bx at all points of tlien dx + N dy) 
» 0 over every simple closed curve C drawn in jR. Conversely, if the 
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line integral in (Q-G) vanishes for every simple closed path C in R, then 


ff /bM dN\ 

for every region R, This enables us to prove that 

dM dN 
dy dx 

at every point of R; for suppose that 

dM dN 

- — - 5 *^ 0 


( 9 *^ 7 ) 


(9-8) 


(9-9) 


at some point P, and for definiteness let this difference be positive. Since 
(dM/dy) {dN/dx) is continuous, there is a small region P' including P 

throughout which the integrand in (9-7) is positive. But this means that 
the integral is also positive, and since (9-7) is known to yield zero for every 
region P, we have a contradiction. Thus, the hypothesis (9-9) is untenable. 
We summarize these results as a theorem. 

Theorem. A riecessary and mfficient condition for the line integral 
(M dx + N dy) to vanish for every simple closed path drawn in a simply 

amnected region R, where M, JV, dM/dy^ and dN/dx are continvoits, is that 
dM/dy = dN/dx at all points of R. 

The vanishing of the integral 


(M dx + N dy) (9-10) 

over every closed path is ecjuivalent to the statement that this integral 
is independent of the path, and it follows from Sec 5 that the expression 
M dx + N dy IS an exact or total differential dxi of a single-valued function 
uix^y) determined by the formula [cf. Eq. (5-G)] 

u{x,y) = r {M dx + N dy). (9-11) 

•'(xo-yo) 

We recognize condition (9-8) to be identical with (5-17). 

The theorem (9-6) can be extended to suitable plane multiply coimected 
domains in the following way. If 22 is a doubly connected region bounded 
externally by a contour Co internally by a contour Ci (Fig. 21), we 
introduce a ^‘cut'^ C joining some point Po on Co with Pj on Ci. The cut 
C can be visualized as a slit in the region P, forming the boundary C + Co 
+ Cl of the slit region. The slit region R is simply connected, and if we 
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apply formula (9-6) to it, we get 
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ff / dM dN\ 

-4 (Mdx + Ndy) + r' (Mdi + Ndy) 

J Co JpQ 

+ (£ (Mdx + Ndy) + [^'(M dx + Ndy), (9-12) 

J Cl JPi 



The arrows on the integrals in (9-12) refer to the direction of integration 

along Co and C\ as shown in Fig, 21, and the integrals / and / are 

evaluated along C in the direction indicated by the limits. Inasmuch as 
r^i r^o 

Jp ~ "~/pi ’ reduces to 


-//.(■ 


jpi 

dM 
R\dy 


dN\ 

dx) 


■)dy dx 

= {M dx + N dy) + {M dx + N dy). (9-13) 


An obvious extension of tliis result to the region R bounded externally 
by Co and internally by n contours C» (Fig. 22) yields 



dy dx 

<£ {M dx + N dy) + ^ <£ {M dx + N dy). 
J c a . ^ J Ci 


(9-14) 


An important result follows directly from formula (9-14) if it is supposed 
that continuously differentiable functions M and N are such that 

dM dN 
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in the region R. If (9-15) holds in J?, the double integral in (9-14) vanishes 
and we get 

<£ (Mdx + Ndy) ^ (Mdx + Ndy) 

J Co •T Ci 

- E(£ (Mdx-hNdy). 

Thus, the line integral over the exterior contour Co taken in the counter¬ 
clockwise direction is equal to the sura of the line integrals over the interior 
contours Ci taken in the sarae direction. In particular, if there is only 
one interior contour Ci (Fig. 21), we conclude that 

f (Mdx + N dy) ^ f (M dx + N dy). (9-16) 

JCo JCi 

This integral need not vanish. If, however, continuously differentiable 
functions M and N are also defined in the region interior to Ci and satisfy 
the condition (9-15) in that region, then the value of the integral 

dx + N dy) is zero, inasrauch as the integral on the left in (9-16) 
vanishes by theorem (9-6). 

PROBLEMS 


1. Show that the following integrals are independent of the path and find their values: 
/•(1.2) 

(a) J \{x^ 4- y^) dx + 2xy dy], 

(b) r dx + 77~~t 2 ^y] ’ 

Ao.o) L(1 H- x)* (1 4“ J 


/(0.{ 

{C) / 

•'( 0 . 0 ) 

riMM) r J 

id) / 

•'(0.0) L v 

r(2.3) 

(e) 


X 5^ —1, 


{y cos X da: 4- sin x dy), 
xy dx 


VT~— dy ^ > 


< J, 


. \/l — 

r''\x 4- 1) dx + (y 4- 1) dy. 

•'( 1 . 1 ) 

2. Write each of the integrals in Prob, 1 in the form j v»dr, and determine u(x,y) 


such that Vu ** v. 

3 . Find the value of 




—ydx 


+ 


X dy 




4- 4- J 

where C bounds the region interior to the circle x® 4* 2 /® ** 4 and exterior to the circle 


4- y* w 1. What is the value of the integral (a) over the circle x® 4“ y® 
the circle x® + y® *** 1? 

4. Compute the integral 


4? (&) Over 


If. 


div V dx dy, 


where v •• ix •+- jy over the region R bounded by the circles x* 4“ y® * 1 and x® + y* »■ 4, 
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6 . tJ»e fortnula (0-13) to evaluate the integral / i^ydx + x dy)^ where C is the path 

Jc 

bounding the region R in Prob. 4. What is the value of this integral over the path 
a** 4 - 3 /* -s 1 ? Over the path x® 4 - 3 /* *- 4? 


10. Curl of a Vector Field. We saw in Sec. 7 that with every contin¬ 
uously differentiable vector function v(P) one can associate a scalar 
div v(P) defined by the formula 

j n>vd(r 

divv(P) =« lim — - (10-1) 

r 0 T 


which has a simple physical meaning. 

We show next that v(P) can also be associated with a vector field called 
curl V, defined by an analogous formula 

j n X V d(r 

curlv(P) = lim - (10-2) 

r 0 r 

We shall see that curl v(P) bears an interesting relation to the concept 
of circulation in the vector field. 

Let v(P) be defined in some regular three-dimensional region P, and 
let C be a simple closed curve in R bounding a plane area ^4. At a given 
point P of A we construct a unit normal v so directed that v points in the 
direction of an advancing right-hand screw when C is traversed in the 
positive sense (Fig, 23). We then construct a right cylinder of small height 



t 

Fia. 23 


h iWth elements parallel to v and with base A and denote its surface by <f 
and its volume by r. 
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Since v is a constant vector, formula (10-2) yields 

j vn X yda 
vcurlv= lim - -(10-3) 

T 0 T 

But along the bases of the cylinder v is parallel to the normal n, and hence 
the triple scalar product vn x v vanishes over the bases. Accordingly, 
the integral in (10-3) need be computed only over the lateral surface of 
the cylinder. We can thus write 

j vn X vhda 

V • curl V = lim -» (10-4) 

T 0 r 

since da ^ h ds. 

But vn X V = V’V X n by Chap. 4, Eq. (0-4), and v x n = t along C, 
where t is the unit tangent vector to C, Thus the integrand in (10-4) can 
be written 

vn X vhd^ = V'V X nhds = v^ihds = hv^dr 

where dr is the differential of the position vector r of a point on C. If 
we further note that r == /i4, we can rewrite (10-4) as 

V • curl V = lim- (lU-5) 

^ 0 A 

The line integral is called the circulation of v along C. If v 

represents the velocity of a fluid, then v*dr = v-t d.s takes account of the 
tangential component of velocity v and a fluid particle moving with this 
velocity circulates along C. A particle moving with velocity v*n normal 
to r, on the other hazid, crosses C. That is, it flows either into or out of 
the region bounded by C. Hence formula (10-5) provides a measure of 
the circvlation per unit area at the point P. This formula can be used to 
compute the cartesian components of the vector curl v by taking v suc¬ 
cessively as the i, j, k base vectors and by evaluating the limit in the 
right-hand member. It is somewhat simpler, however, to get the formula 
for curl V in cartesian coordinates from the definition (10-2) with the aid 
of the divergence theorem.* 

Now the components of n x v in cartesian coordinates are n x vi, 
n X v*j, n X v*k. Consequently, 

n x V =« i(n X vi) + j(n x v*j) + k(n x vk) 

i(n*v X i) -f j(n-v x j) -f k(n*v x k). (10-6) 

* Also the uniformity of approach mentioned in connection with (7-3a) will then yield 
the ftame kind of uniformity for (10-2). 
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On inserting from (10-6) in (10^2) we get 

j (n*v X i) d(r j x j) do- 

curl V » i lim -h j lim -- 

r->0 r T-+0 T 


j (n-v X k) da 

+ k lim -(10-7) 

T -♦ 0 r 

But a comparison of the right-hand member of (10-7) with (10-1) enables 
us to rewrite (10-7) in the form 

curl V = i div (v x i) -f- j div (v x j) + k div (v x k). flO-8) 


On inserting 


V = i^x + ivy + ki;. 


in (10-8) we get 

curl V =* i div (jvz — kvy) -h j div (kt;* — ivg) + k div {ivy — jV^), 


and a simple calculation making use of formula (7-5) yields the desired 
result 


/dv^ dVy\ , /dVr dVz\ /dVy dVx\ 

‘ w - fc) +' (ill - 


If we recall the expression (3-13) for the symbolic vector V, we can write 
(10-9) compactly as 

i j k 


curl V — 


d d d 
dx dy dz 

Vx Vy ^ Vj, 


s V X V. 


(10-10) 


An analogous formula for curl v in orthogonal curvilinear coordinates 
is given in Sec. 13, and several useful relations involving the use of the 
curl operator are recorded in Prob. 1. 


Example: Compute curl v if v « ixyt + ixyz^ + \u^yz. 

The substitution of v* «« xyz^ Vy * xyz^, v, x^yz in (10-9) yields 


curl V -« i{x?z — 2xyz) + Hxy — Zx’^yz) -f kiyjp’ — xz). 
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PROBLEMS 


1. Show that under suitable hypothesis on continuity of the derivatives: 

(a) curl (A 4* B) » curl A -f curl B; 

(b) div curl A«V*VXA««0; 

(c) curl curl A » V div A — V*A, where V*A ^ iV®Ax -h + kvM*; 

(d) curl Vu » V X (Vu) « 0; 

(e) curl (uA) V X (uA) * wV X A -f Vw X A; 

(/) div (A X B) «« B’curl A— A-curlB; 

(g) curl (A X B) *= AV*B - BV-A 4- (B*V)A - (A-V)B, where (A-V)B as C is 
the vector with components 


<)Sx dSx dBx 

dx dy dz 



4. Let a rigid body rotate with constant angular velocity fl about some axis through 
a point 0 m the body. If r is the position vector of a point P(x,t/,z) relative to a set of 
axes fixed at 0, the velocity v of P is v «» vo 4- fl x r, where vo is the velocity of 0 rela¬ 
tive to some reference frame fixed in space (cf. Sec. 8, Chap. 4). Show that curl v ~ 2ft, 
80 that the angular velocity ft at any instant of time is equal to one-half the curl of the 
velocity field. Note that the velocity vo is independent of the coordinates (x,y,z) of 
points in the body. 

6. Show^ from geometrical considerations that the angle de subtended at the origin by 
an element ds of a curve is dO « (n*r/r^) ds. 

6, A solid angle w subtended by a surface <r is measured by the area subtended by the 
angle on a unit sphere 8 with center at the vertex of w. Show that 


f i j 

03 =«* — / n * V - d(r. 

J<r r 


where r is the position vector of points on a measured from the vertex of co and n is the 
unit normal to a. Hint * Apj>Iy the divergence theorem to a volume formed by the bundle 
of rays issuing from the solid angle and by the areas cut out by these rays on S and on <r. 
7. Referring to Prob. 6, show from geometrical considerations that 


do3 «** —r- da, 

r 
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11* Stokes’s Theorem, This useful integral transformation theorem 
enables one to reduce the evaluation of oertam surface integrals to the 



calculation of line integrals. 

I^et be a three^limensioual re¬ 
gion in which v(P) is a continuously 
differentiable vector function and <7 
a regular open surface embedded 
in R, We suppose that the edge 
of O' is a simple closed curve C 
(Fig. 24). Then it is true that 

j n * curlvdcr =^v*(ir, (11-1) 


where n is a unit normal to a and 
the line integral over C is evaluated in the direction determined by the 
chosen positive orientation of n. 

To establish formula (11-1), which is knowm as Stokes s theorem^ we 
follow a procedure similar to that used in Sec. 8 to prove the divergence 
theorem. We subdivide <r into k approximately planar elements of area 
Ao-i, each bounded by a simple contour C» (say triangular) (see Fig. 24). 
Then formula (10-5) with v replaced by and A by Aa^ when applied to 
the element bounded by C* yields 


curl v{Pi) A<Ti == / v*dr + c,* 

JCi 


( 11 - 2 ) 


On summing these expressions over the entire surface o* we get 

k k k 

5^ Of • curl v(P,) A<r,- = 53 / v-dr + 53 

4=«I »w.l 


(11-3) 


But the line integrals in (11-3) when summed over the common bound¬ 
aries of adjacent elements cancel out, since such boundaries are trav¬ 
ersed twice in opposite directions. The surviving terms yield the line in¬ 
tegral J^v*dr over the boundary C. If the number k of elements Acr,- 

is allowed to increase indefinitely, so that the greatest linear dimensions 
of the Aor^ tend to zero, the sum on the left becomes the surface integral 

/ n* curl V d<r. The sum €,• A<r» tends to zero as in the discussion of 

(8-3).^ Thus, formula (11-1) is correct. It should be noted that once a 
positive direction for the normal n has been agreed upon, the positive 
direction of description of the contours C^, and hence of C, is determined 
by the right-hand-screw convention. 

* That is, if ail | «il are less than «, this sum is less than where 8 is the area of a-. 
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If ir is s closed surface, the sum of the line integrals over the contours 
Ci is zero, and in that event n- curl v dir « 0* 

We note further that if curl v « 0 in then L v*dr = 0 for an arbitrary 

fp ^ 

closed contour C. Hence the line integral I v*dr is independent of the 

JPo 

path and thus defines a function u(P), such that du = v*dr. We can 
show conversely^ that if the line integral in (11-1) vanishes for every 
closed path C in R, then curl v = 0 throughout R. Reference to (10-9) 
shows that the condition curl v = 0 is identical with Eqs. (5-17) and 
(5-18), ensuring that V‘dr = du. 

Example: Evaluate / n • curl v dtr over the surface t -fVa* — z® — if 
J<r 

V M i2y — jx 4- Iw. 

The surface in this example is a hemisphere of radius a, and it is clear that 

. ^ ,y ^ 

r r T ^ 

_ /* 

where r * ix + jy 4* is the f)osition y 

vector for points on the hemisphere. We I ^ 

readily check that ^ 


dx dy dz 

I 2y —X z I 

Hence / n • curl y dcr ^ —3 / - d<r. 

J<r J<X a 

This integral can be easily evaluated by 
noting that (Fig. 25) da «* sec y dx dy^ 
where y is the angle between the normal 
n and the positive direction of the z axis 
(cf. Chap. 3, 8ec. 17). But from Fig. 25, 
sec y » sec d w a/z, so that 



5 jj dx dy 


since the region of integration A is a circle of radius o. The reader will check this result 
by taking da »» a* sin eded<t> as the element of area of the surface of the sphere in 
spherical coordinates. 

To obtain the result (11-4) from Stokes’s theorem (11-1) we compute / y*dr, where 

Jc 

C is the boundary of the circle x* 4- y® « o*. Since dr « i dx 4- j dy -b k we have 


/ v»dr *» I {2y dx -- X dy + z dz), 
Jc Jc 


1 See the corresponding discussion in Sec, 9, 
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But along C we have 0, and the equation of C may be taken in the form 


We thus get for (11-5) 



X » a cos 

y a mvi4>f 0 ^ < 2ir. 

2 «’ 

a^(2 sin* 4> f cos* 4>) d4> 



—3ira*. 


PROBLEMS 

1. Show that for the special case of a plane region bounded by a simple closed curve 
C, Stokes’s theorem reduces to Green's theorem (9-6). 

2. If V « iy -h + kic and a is the surface of the paraboloid * « 1 — x* — y*, 

> 0, compute / n • curl v d<r. 

r 

3. What is the value of the surface integral / n • curl vda if v » ~f- jxy 4- kxr 

J<r 

and <r is the hemisphere x* 4* y* + =* 1, 2 > 0? Evaluate this integral directly and 

by Stokes’s theorem. 

4. Compute / V‘dr if v « i(x* 4- y^) 4- 4” 2 ^) 4* ky and C is the circle x* 4- y* «« 

Jc 

4 in the plane « « 0. 

3. Prove that the area A of the plane region bounded by a simple closed curve C in 
the xy plane is given by 

^ = 2 - ydj) 

when C is described in the positive direction. Hmt: Use Green’s theorem ( 9 - 6 ). 

6 . Verify Stokes’s theorem if v * iy* 4- j^ry — krz and <r is the hemisphere z « 


X* - y*. 


ILLUSTRATIONS AND APPLICATIONS 


12. Solenoidal and Irrotational Fields. Let a continuously differentiable 
vector function v(P) be specified in a region R, If curl v = 0 at every 
point of Ry we say that v(P) is an irrotational vector field. If v(P) is such 
that div V = 0, the field is said to be solenoidal. The importance of 
solenoidal and irrotational vectors in applications derives from the fact 
that every continuously differentiable vector function v(P) defined in a 
regular simply connected region R can be expressed as the sum of two 
vector functions, one of which is solenoidal and the other irrotational. 
We do not prove this fact here because it depends on demonstrating the 
existence of solutions of certain partial differential equations,^ and it would 
carry us too far in the study of potential theory. Accordingly, we limit 
our discussion to proofs of two basic theorems concerned with solenoidal 
and irrotational vector fields. 

See Prob. 6, A discussion of the system of equations in question is contained in 
M. Mason and W, Weiiver, *The Electromagnetic Field/^ pp. 352-^5, University of 
Chicago Press, Chicago, 1932. 
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Theokem I, A necessary and sufficient condition (hod a <mUnuomly 
differentiable vector v{P) be irrotational in a simply connected regular region 
R is that V «= Vu, where u is a single-valued scalar function with continuous 
second derivatives. 

We suppose, first, that v = Vw; then 


curl V * curl Vu =« 


i 

j 

k 

a 

a 

a 

dx 

dy 

dz 

du 

du 

du 

dx 


Tz 



as follows at once on expanding the determinant and noting the equality 
of the mixed partial derivatives of u(x,y,z). 

Conversely, if we suppose that curl v = 0 in P, then it follows from the 
concluding paragraph of Sec. 11 that du == v*dr and hence v =» Vu. 

Theorem II. The continuously differentiable vector function v(P) is 
solenoidal in a region bounded by a regular surface if, and only if, it is equal 
to the curl of some vector w with continuous second derivatives. 

Let us suppose, first, that v « curl w. Then 


div V = div curl w = 0, 


as follows from a simple calculation ^ making use of formulas (7-5) and 
(10-9). Conversely, if div v = 0, we show that a vector w can be con¬ 
structed such that V = curl w. It suffices to show that the system of 
equations curl w — v, or 

dw,i dWy 

dy dz 


dWx dWg 

dz dx 




( 12 - 1 ) 


dWy dWx 

dx dy 


has a solution for w^, 


Wy, Wg whenever 

dVx dVy dvg 
dx dy dz 


0 . 


(12-2) 


^ See Prob. 16, Sec. 10. 
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We dlow how to construct one such soluticm in rectangular domains. 
If we take te* 0, then the second and third of Eqs. (12-1) require that 


BWz 




= -Vu(x,y^), ^ = »,(»,y,2)- (12-3) 

Bx Bx 

On integrating (12<^) with respect to x and treating y and z as constants, 
we get 

tB,--! Vy(x,y,z) dx -h 

•'XO 

(12-4) 

w„== f v,{x,y,z) dx + \f>(.y,z), 

JXQ 

where 0 and ^ are arbitrary differentiable functions of y and z. If we 
insert these solutions in the first of Eqs. (12-1), we get 


But from (12-2) 
so that (12-5) yields 


■'aroVay Bz/ By 


Bz 


(12-5) 


Bvy Bvj. Bv^ 

By Bz Bx 


Jx 


'X BVx B<l> B^ 

dx + — ~ — 

^0 Bx By Bz 

B<f> Byp 

vx(x,y,z) - v^(xo,y,z) +- -- 

By Bz 


( 12 - 6 ) 


(12-7) 


This equation can be satisfied by taking ^ 0 and 

<t>{y,z) = / v^{xo,y,z) dy. 

*'1/0 

Thus, one solution of the system (12-1) and (12-2) is 

Wx *= 0, 

Wy = r Vx(x,y,z) dx, 

</xo 

w, = - / Vy{x,y,z) dx+ r Vx{xo,y,z) dy. 

JxQ ■'yo 

The proof clearly indicates that w is not unique. Indeed, if we take w 
with components given by (12-7) and add to it Vu, where u is an arbitrary 
scalar function with continuous second derivatives, then 

curl (w + Vu) « curl w 

feasmuch as curl Vu m 0,^ 

* Conversely, if curl wi « v, then curl (wi — w) -■ 0 and wi — w •» Vw by Theorem 
I. Thus every solution wj is representable in the form w 4- where w is the par¬ 
ticular solution found in the text. 
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We remark in conclusion that whenever the divergence and curl of a 
vector function v are specified in the interior of a regular simply connected 
region and the normal component of v is known over the surface bounding 
the region, then there is just one vector function v satisfying these condi¬ 
tions. This uniqueness theorem is important in many applications. The 
reader may prove it by following suggestions given in Prob. 7 below. 

PROBLEMS 

1 . Show that V «» i2xyz + }xh -f kx^y is irrotational, and find u(x^y,z) such that 

V »» Vu. 

2. Show that v * i(« ~ j/) -h j(jJ — 2 ) + k(y — x) is solenoidal, and find w(x,y, 2 ) 
such that V « curl w. 

а. Is V » i(j/* -f 2 x 2 * — 1) -f j2xy + k2x*s irrotational? If so, find u such that 

V Vu. 

4. Is V « i(x ®2 — 2xyz) + }(xy — Bx^yz) -f k(yz^ — xz) solenoidal? If so, find a w 
such that V » curl w, 

б . Prove that v « r”r, where r ix -p j|/ -h h 2 , is irrotational. Is it solenoidal? 

6 . Let w «* u + V, where u is irrotational and v solenoidal in a given suitably re¬ 
stricted region R, Then there exists a vector q such that v »» curl q and a scalar 4> 
such that u =» V<t>, Show that <i> and q satisfy the following partial differential equations: 

« div w, V div q — V*q ■* curl w. 

7. If V is a continuously differentiable vector function defined in a regular simply 
connected region R bounded by the surface a and if 

curl V - f(x,i/, 2 ), div V «• g(x,y,z) 

in R and v^n « h(x,y,z) on cr, show that v is uniquely determined in R by these con¬ 
ditions. 

Outline of the Solution. Assume that there are two such vectors, v « vi and v V 2 . 
With w =» vi — V 2 , show that there is a w such that w «= Vw, and deduce » 0. 

By applying the divergence theorem to the vector wVw, show that / (Vu)-(Vu) dr 0. 

Jk 

Since (Vu) - (Vu) > 0, this integral can vanish only if Vu s 0. 

13. 'Gradient, Divergence, and Curl in Orthogonal Curvilinear Coordi¬ 
nates. In this section we record the expressions for the gradient, diver¬ 
gence, curl, and I^aplacian in orthogonal curvilinear coordinates. These 
can be obtained from the definitions (7-3), (8-7), and (10-2) in a manner 
BO similar to that used to obtain formulas valid in cartesian coordinates 
that we dispense with the details of calculations. 

As in Sec. 1, we suppose that a transformation 

Vi =* yiixi,X2,xz), i == 1 , 2 , 3 , 

wherein the variables yi are cartesian, defines a curvilinear coordinate 
system x. We suppose that the coordinates Xi are orthogonal so that the 
quadratic differential form (2-13) has the structure 

(ds)^ * gnidxi)^ + (722(dxj)* + gzzidxn)^. 
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We denote the unit base vectors along the Xi coordinate lines by ei, 62 , ea 
and represent a vector v(P) in the form 

V »= ei^i + e2t>2 + esVQ, (13-1) 

The volume element dr formed by the coordinate surfaces x, = const 

and Xt 4- dx^ - const (Fig. 26) has 
the shape of a rectangular pamllel- 
epiped with edges ^ dst = \/ dx^, 
Hence the areas da-ij of its fac(*s are 

da 12 = Vgiig 22 dxi dx 2 , 

dan = dxi dx^, (13-2) 

da2z = Vgr22{733 dx2 dx^, 
and its volume dr is 



dr = VgiiSf22g33 dxi dx 2 dx 3 , (13-3) 

To compute div v we calculate the flux j^n*v da over the surface of the 

volume element dr and divide it by its volume (13-3). A calculation like 
that performed in Sec. 7 yields the result 


1 [^(t’lVis) d{v2hihs) 

div V =--1-;-h 

h\h2h^ L dX| dx2 


d(vshih2y 
dx3 . 


(13-4) 


where ^ V^. 

A similar but slightly longer computation also yields the formula 

, 1 [^(/isva) d(M 2 )l 

curl V = ei-- 

A 3/12 L dx2 J 


r s(hivi) 

dihsVzY 

1 

+ ©3 7~7~ 

d(h2V2) 

a(/nri)l 

L dXs 

dxi _ 

hih2 

_ dxi 

dX2 J 


(13-5) 


which can be written more compactly as 


curl v = 


1 


Aie, 

h2^2 

^3®3 

d 

d 

d 

dxi 

dX2 

dX3 

hivi 

^2*^2 



(13-6) 


' See Sec. 2. 
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Finally, the formula for the gradient of a scalar w(xi,X 2 ,X 3 ), as follows 
from (8-7), is ^ 


Vu 


ei du ^ 

h'^ dX}^ dXg h'^ dxg 


(13-7) 


Inasmuch as div Vu = V^u, it is easy to check that the substitution 
of V = Vw in (13-4) yields 




1 


^ /^2^3 d /hih^ 

dJi \ hi dxi/ dx2 V h2 



(13-8) 

In cylindrical coordinates defined by the transformation 

X = r cos 
y = r sin 
2 = 2, 

the metric coeflScients are ^ 

{7ll = 1, ^22 = fi^33 = 1, 

so that = 1 , ^2 == ?*, h-A = 1 . 

Accordingly, formulas (13-4) and (13-8) yield 

1 (9(nv) 1 ^ve dvg 

div V =-1-1-» 

r dr r 00 Oz 



1 d‘^u 


d^u 


where v = iiCr + BxVo + kr* 

ri, 01 , k being unit vectors in the direction of increasing r, 6^ and z (Fig. 3), 
In spherical coordinates determined by 

X = p sin ^ cos <l>y 

y = p sin ^ sin </>, 

2 = p cos 0, 

1 Henceforth wo shall use the symbol Vu to mean grad u in curvilinear coordinates as 
well as in cartesian. 

* See See. 2, 
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hi « 1, Aji « p, Aa « p sin 0, as follows from Prob. 2 in Sec, 2. On making 
use of (13-4) and (13-S) we find that in spherical coordinates 


div V 


1 d(p%) 1 d{sin$ve) 1 dv^ 


dp 


p sin d dB 


+ 


p sin B d<t> 


V^u 


I \ dp/ 1 \ dB/ 

^ ^ -^ ^ -^ 


1 


d^U 


where 


dp p^ sin 6 dB p^ sin^ B dtf>^ 

V = PlVfi + diV0 + 

and pi, 8i, are the unit vectors in the direction of increasing coordinate 
lines shown in Fig. 4. 

PROBLEMS 


1 . Write out the expressions for Vu in spherical and cylindrical coordinates. 

2 . What is the form of V* in parabolic coordinates (u,for which* 

(dsy - (u^ -h i^)l(du)^ + (dv)^} 4- uh\d4?‘> 

8 . The force P per unit charge due to a dipole of constant strength p is 
F »■ ri(2p cos Ofr^) -f- 0i(p sin d/r*), 
where r, 9 are polar coordinates. Compute div F and curl F. 

14. Conservative Force Fields. In the concluding sections of this chap¬ 
ter we illustrate the use of vector analysis in the treatment of several 
problems drawn from mechanics, hydrodynamics, and the theory of heat 
flow in solids. 

When a particle of matter is displaced along a path C in a given field 
of force F, the work W expended in moving it is determined by the integral 

W=j¥-di. (14-1) 

The integral (14-1), in general, will have different values for different 
paths joining the same two pointe in the force field. If (14-1) is independent 
of the path, the field F is said to be conservative. 

We show next that the force field determined by Newton's inverse-square 
law of attraction is conservative.^ According to Newton's law a particle 
of mass m located at a point P is acted on by a force F whose magnitude 
is proportional to m and inversely proportional to the square of the distance 
r from P to the center of attraction 0. Thus, 

km 

F« ri, (14-2) 

* A shnilar discussion applies to electrostatic force fields determined by Coulomb’s 
Jaw, idnce the mathematical structures of Newton’s and Coulomb’s laws are identical. 
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where is the unit vector directed from 0 to P. The positive constant 
k is determined experimentally; it clearly depends on the choice of units 
of measure of F. Physically the law (14-2) represents the force of attraction 
of the mass m at F by a unit mass located at 0. 

If we rewrite (14-2) in the form 

km 

F--—r, (1«) 


where r = rri, and insert it in the work integral (14-1), we get for tlie 
work done in displacing the particle from Pi to P 2 along the path C, 

f km 

W » -g-r*dr. (14-4) 

But r*dr =« ~ rdr, so that we can write (14-4) as 



The integral (14-5) is clearly independent of the path joining Fj and ^ 2 , 
and if we denote r(F 2 ) by r 2 and r(Fi) by ti (Fig. 27), we can write 



The function u{P) in (14-6) is con¬ 
tinuous at all points except when r = 0, and since div Vu = we readily 
find that the gravitational potential (14-6) satisfies Laplace's equation 


= 0 

except when r = 0. 

The gravitational potential at a point F due to a continuous distribu¬ 
tion of mass of density p is defined by the integral 

/ kp dr 

u{P) « /-- (14-8) 

jf f 

where r is the distance from the element of mass dm ^ p dr to the point F. 
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The force of attraction of the unit mass located at P by the body is 
determined by the formula F « Vw. 

The study of the properties of the scalar function u{P) defined by 
(14-8) is in the province of potential theory, and we shall encounter it once 
more in Chap. 6. 

Example: Let us compute the gravitational potential u(P) of a thin homogeneous 
spherical shell of radius o at a point P whose distance from the center of the shell is R 
(Fig. 28). 



Fig. 28 


The potential at P can be computed by summing potentials of the ring-shaped ele¬ 
ments of matter bounded by the cones with the somivertical angles 6 and 0 dB. The 
area of the zone inUjrcepted by these cones is 2wa sin 0 a dO, so that 


uiP) « f 
Jo 


kp2ird^ sin & dO 


where p is the mass per unit area of the shell. 
From the cosine law of trigonometry 


and we can write (14-9) os 


Va' + - 2aR ( 


u(P) — 2 xki>a^ f 

Jo V a“ H 


-1- 2aR cos B 


if i? > a, 


'[V{R-^a? 


^(a-/?)*), ifR;<a. 


If P is outside the shell, R > a, and we have the result 

Airkpa^ kM 


(14-10) 


where M •• 4iraV is the mass of the shell. 
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When P is inside the shell, R < a, and we get 

u{P) « 4i‘rkpa, (14-11) 

a constant. 

The result (14-10) can be stated as a theorem. 

Thkouem. The potential {and hence the force of attraction F « Vu) produced by a thin 
spherical shell at a po'inl exterior to the shell is the same as if the mass of the shell were can* 
centraied at its center. 

Tlie poU'ntial due to a solid sphere of constant density p at a point outside the sphere 
can be deduced at once from (14-10) by supposing the sphere to consist of thin concen¬ 
tric shells We conclude that this potential has the same form as (14-10) with M re¬ 
placed by the mass of tlu' sphere. Accordingly, the force of attraction produced by a 
solid homogeneous BjiUere on a unit mass at a point P outside the sphere has the mag¬ 
nitude kM/R^. This force is directed toward the center of the sphere 

From (14-11) we see that the force of attraction at a point inside the shell is zero. 

The integral (14-8) becomesJmprofier if P is witliin the solid, for in that case, r «» 
'n/(j ~ + (y ~~ rif -f (2 — becomes zero when tiic intx»gration variables 

coinride with the coordinaWs (x,i/,z) of P. However, the concepts of potential and gravi¬ 
tational attraction can be shown t,o have a meaning even when P us a point in the 
interior of a homogeneous solid.* 


16. Steady Flow of Fluids. Let C be a curve in the xy plane over which 
a sheet of homogeneous fluid of 
depth 1 is flowing. 'The linos of flow 
of the fluid particles are indicated in 
Fig. 29 by curved arrouvs, and \vc 
suppose that the flow pattern is 
identical in all planes parallel to 
the xy plane. A flow of tins sort is 
called two-dimensional. 

The problem is to determine the 
amount of fluid that crosses C per 
unit time. We denote by v the ve¬ 
locity of the fluid particles at a point 
P on C and compute the volume dV Fig, 29 

of fluid crossing an elemtut dt of C 

per unit time. Since the depth of the fluid is 1, this volume is equal to the 
volume of the parallelepiped 

dV = k'V X d\f 

where k is the unit vector perpendicular to the xy plane. The volume V 
crossing C per unit time, tlierefore, is 

V — f k*v X dr. 

Jc 

' See in this connection Sec, 20, Chap. 1, and I. S. Sokolnikoff, ‘Tensor Analysis,” 
sec. 89, John Wiley & Sons, Inc., New York, 1951, where it is shown that the potential 
u satisfies Poisson's equation « — 4irp. 
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But by Cliap. 4, Eq. (6^2), 


k*v X dt 


0 0 1 
Vx Vp 0 
dx dy 0 


Vx dy — Vy dXf 


since v » iv* + jvy and dt ^ idx + ) dy. 
Accordingly, 


V 


(Px dy - Vy dx). 


(15-1) 


If C is a closed curve and the fluid is incompressible, the net amount 
of fluid crossing C is zero, because as much fluid enters the region bounded 
by C as leaves it. Thus a steady flow of an incompressible fluid is char¬ 
acterized by the equation 

{vxdy — Vydx) « 0, (15-2) 


where the integral is evaluated over any closed curve C not enclosing the 
points at which the fluid is generated or absorbed. But Eq. (15-2) implies 
that —Vy dx + Vx dy is an exact differential d^{x^y) of the function 


Moreover,^ 


^{x,y) = 


J (xo.yo) 


{—Vydx 4- Vxdy). 


dx 


Vyj 






and Vx and Vy satisfy the condition 

d{—Vy) dVx 

dy dx 


(15-3) 

(15-4) 


(15-5) 


throughout the region R in which (15-2) holds. Equation (15-5) is a con¬ 
sequence of (15-2); it states, in effect, that there is no fluid created or 
destroyed in the region R. For this reason it is called the equation of 
continuity. Since 

dVx dVy 

div V «-1-» 

dx dy 

we can write (15-5) in vector form as 

div V « 0, (15-6) 

which is consistent with the meaning attached to the symbol div v in 
Sec. 7, 


iSeeSec.5. 
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The function defined by (16-3) is the stream function, and the 

tracks of the particles of fluid, or streamlines, are determined by the equa¬ 
tion '^{x,y) =• const. The velocity field satisfying (16-6), we recall, is 
said to be solenoidal. If the flow v is irrotational, then curl v == 0 and 
there exists a scalar function $(x,y) such that ‘ 


(16-7) 

( 16 ^) 

The function 9{x,y) determined by the integral 

rix,y) rP 

^(z,y) « / (v^dx + Vydy) ^ / v^dr 

'(aftt.ifo) JPo 




d^ 

d^ 

—, 

i; as- 

dx 

dy 


is called the velocity potential because of the relations (15-8). We emphasize 
the fact that the condition for the existence of ^(x^y) is 


or in scalar form 


curl V a= 0 

dw* dVy 

dy dx 


(15-9) 


If the flow is both irrotational and solenoidal, the relations (15-4) and 
(15-8) hold and we conclude that 


— « —» — -in K. (15-10) 

dx dy dy dx 

These are the celebrated Cauchy-Riemann equations which we shall en¬ 
counter again in Chap. 7. 

Furthermore, if div v = 0 and v is given by (15-7), we see that 

div = VH = 0. (15-11) 

Thus, the velocity potential 4> satisfies Laplace^s equation tliroughout any 
region containing no sources or sinks. 

On differentiating the first of Eqs. (15-10) with respect to y and the 
second with respect to x and on equating di^^/dx dy to d^^/dy dx, we find 
that the stream function ^(x,y) also satisfies the equation 

= 0 . 

The practical importance of these results is stressed in Chap. 7, Secs. 19 
to 21. 

The foregoing considerations can be extended to the three-dimensional 
flows as indicated in Sec. 17. 


> See Sec. 12. 
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PROBLEMS 

L Show that the gravitational field determined by (14-2) is both solenoidal and 
irrotationi^l except at (0,0,0). 

2. Show that the velocity field 

. _ y _ 

^ -f ^ 

is solenoidal in any region which does not contain the origin (0,0). Is it irrotational? 
Verify that the velocity potential » log r « log (r^ + \h and the stream function 

«* tan~^ {yjx) « B, Compute the circulation around a circular path enclosing the 
origin, and thus obtain a physical interpretation of results in Probs. 6 and 6 of Sec. 5 
and Prob. 3 of Sec. 9. 

2. Discuss a two-dimensional flow for which the velocity potential <l> « cx. What is 
the stream function W for this flow? Plot the curves * const and 4^ = const. 

4- Discuss a two-dimeiLsional flow for which the stream function i.s 4' « 2xy. Find 
the velocity potential ‘I', and sketch the curves = const and 4' = const. 

5. If V and w are irrotational vector fields, show that v x w is solenoidal. 

6. Show that the streamlines are orthogonal to the lines 4» » const. 

7. Show that when the three-dimensional flow v is irrotational, the streamlines satisfy 
the equations 

dx dy dz 

Vx % Vx 

8. If the velocity potential of the two-dimensional flow is — y*, find v and 

obtain the equations of the streamlines. Is this flow solenoidaP Is it irrotational? 

9. Show with the aid of the Cauchy-Riemann equations that when the stream func¬ 
tion 4f(x,y) is given, the velocity potential is determined by 


4>(x,y) 


rfx,y) ^4^ , \ 


10. Use the result given in the preceding problem to calculate 4>(i,y) if (a) 4^ = — 

Zzy^, (b) 'F « -y/{x^ 4 .y^), x 5*^ 0, y 3*^ 0. 


16. Equation of Heat Flow, The following derivation of the Fourier 
equation of heat flow illustrates admirably the use of the divergence 
tlieorem in mathematical physics. 

It is known from empirical results that heat will flow from points at 
higher temperatures to those at lower temperatures. At any point the 
rate of decrease of temperature varies with the direction, and it is generally 
assumed that the amount of heat AH crossing an element of surface Aa 
in At sec is proportional to the greatest rate of decrease of the temperature 
u; that is, 

du 

AH « h Aff At — 
dn 


Define the vector q, representing the flow of heat, by the formula 

q »= -* Vu, (16-j) 
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where fc is a constant of proportionality known as the thermal conductivity 
of a substance. [The units of fc are cal/(cm-sec ^^C).] The negative sign 
is chosen in the definition because heat flows from points of higher tem¬ 
perature to those of lower, and the vector Vu is directed normally to the 
level sxirface u = const in the direction of increasing u. 

Then the total amount of heat H flowing out in M sec from an arbitrary 
volume T bounded by a closed surface <r is 


f du 
tf fc — d<r 


i j q-n da, 


since q^n ~ —fc du/dn by (16-1). 

On the other hand, the amount of heat lost by the body r can be cal¬ 
culated as follows: In order to increase the temperature of a volume ele¬ 
ment by one must supply an amount of heat that is proportional to 
the increase in temperature and to the mass of the volume element. Hence 

du 

AH == c Au p At ^ c — At p At, 
dt 

where c is the specific heat of the substance [cal/(g ®C)] and p is its density. 
Therefore, the total loss of heat from the volume t in At sec is 


f du 

[ I — cp dr. 

Jr at 


Equating (lG-2) and (16-3) gives 


f q-nda ~ f 

Ja Jr 


Applying the divergence theorem to the left-hand member of (16-4) yields 


f f du 

/ div q dr = — / — cp dr, 
Jr Jr at 


and since q = —kVu, the foregoing equation assumes the form 


div ( —fcVw) + Cp — dr 
dti 


Now, if fc is a constant, 
and (16-6) becomes 


div (fcVw) = fcV^w 


-fcV^tt-f cp-™)dr sO. 


( 16 - 6 ) 
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Since this integral must vanisli for an arbitrary volume r and the integrand 
is a continuous function^ it follows that the integrand must be equal to 
zero, for if such were not the case, r could be so chosen as to be a region 
throughout which the integrand has constant sign. But if the integrand 
had one sign throughout this region, then the integral would have the same 
sign and would not vanish as required by (16-0). 

Therefore, 

— kV^u + cp — « 0 
dt 


or 

where 


dll 

— « h^V\ (16-7) 

dt 

cp 


Equation (16-7) was developed by Fourier in 1822 and is of basic impor¬ 
tance in the study of heat conduction in solids. A similar equation occurs 
in the study of current flow in conductors and in problems dealing with 
diffusion in liquids and gases. 

It follows from (16-7) that a steady distribution of temperatures is 
characterized by the solution of ivaplace’s equation 

Vh - 0 . 


It was assumed in this derivation that the body is free from sources and 
sinks. If there are sources of heat continuously distributed within r, 
tlien it is necessary to add to the right-hand member of (16-3) the integral 

jj{x,y,z,t) dr, 

where fix^y^z^t) is a function representing the strengths of the sources. 
The reader will show that in this case one is led to the equation 

— = h^V'^u + —. 

dt Cp 

provided that the thermal conductivity of the substance is constant. 
Thus the presence of sources leads to a nonhomogeneous partial differential 
equation. 

17. Equations of Hydrodynamics. Consider a region of space containing 
a fluid, and let v denote the velocity of a typical particle of the fluid. 
Tho amount Q of fluid crossing an arbitrary closed surface a drawn in 
the region can be calculated by determining the flow across a typical 
element A<t of the surface <r. A particle of fluid is displaced in At sec 
through a distance v At, and since only the component of the vector v 
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normal to the element Atr contributes to the flow across tliis element, the 
amount AQ of the fluid crossing A<r is 

AQ ^ pv-n A<r At, 


where p is the density of the fluid (Fig. 30). 

The entire amount Q of fluid flowing out of the volume r, which is 
bounded by a, in At sec is 


Q ^ At f pv-nd<r. 

J<r 

On the other hand, the quantity of the 
fluid originally contained in r will have 
diminished by the amount 




for the change in mass in At sec is nearly equal to (dp/dt) At At, and the 
negative sign is taken because p is a decreasing function of t. 

Equating these two expressions for Q gives 


/. 


pv*n d<T 



(17-1) 


and the application 
this equation yields 


or 


of the divergence theorem to the left-hand member of 



Since the integrand is continuous and the volume r is arbitrary, one can 
conclude that 

dp 

— + div (pv) « 0. (17-2) 


This is the basic equation of hydrodynamics, known as the equation of 
continuity. It merely expresses the law of conservation of matter. 

It has been assumed that there are no sources or sinks within the region 
occupied by the fluid. If matter is created at the rate kp(x,y,z,t), then the 
right-hand member of (17-1) should include a term that accounts for the 
increase of mass per second due to such sources, namely, 

j kpdr. 
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dt 


+ div (pv) 


kp. 


The constant of proportionality k is sometimes called the growth factor. 

The density p(x,y,Zyt) of the fluid at the location {x,y,z) of the fluid 
particle depends on t explicitly and on x^y.z implicitly, since the particle 
coordinates change with time as the particle is displaced. Thus, 


dp dp dp dx dp dy dp dz 
dt dt dx dt dy dt dz dt 


(17-3) 


In this equation, dp/di means the rate of change of density as one moves 
with the fluid, whereas dp/dt is the rate of change of density at a fixed 
point. 

Upon noting that 


dx dy dz 

i-h j-h k — 

dt dt dt 


dp ^ dp dp 

and Vp = i-(-j-hk —♦ 

dx dy dz 

we can write the formula (17-3) as 


dp 

dt 


dp 

-h V* Vp. 

dt 


Substituting from 


(17-2) in (17-4) gives 
dp 

— = — div (pv) + v-Vp. 
dt 


(17-4) 


(17-5) 


But div (pv) = v*Vp + p div v (see Prob. 36, Sec. 7), so that (17-5) 
becomes 


dp 

dt 


—p div V, 


Idp 

or divv= -- (17^) 

p at 


It is clear from (17-6) that div v is equal to the relative rate of change of 
the density p at any point of the fluid. Therefore, if the fluid is incompres- 
sibte^ the vdocity field is characterized by the equation 

div V » 0. (174) 

If the flow of fluid is irrotational, then curl v » 0, and one is assured 
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V = V#. 


Substituting this in (17-7) gives the differential equation to be satisfied 
by f-, namely, 

3*4- d*4> 


„ 3*# 3-‘4- 

^- 1 -L 

dx^ dz^ 


0 . 


(17-8) 


The function 4> is called the velocity potential. A similar result was obtained 
in Sec. 15 for the two-dimensional flow. 

If the fluid is ideal, that is, such that the force due to pressure on any 
surface element is always directed nonnally to that surface element, one 
can easily derive Euler’s equations of hydrodynamics. Denote the pressure 
at any point of the fluid by p; then the force acting on a surface element 
Aar is — pn A<r, and the resultant force acting on an arbitrary closed surface 
a is 



The negative sign is chosen because the force due to presKSure acts in the 
direction of the interior normal, whereas n denotes the unit exterior normal. 

Let the body force, per unit mass, acting on the masses contained 
within the region r be F; then the resultant of the body forces is 

flpdr. 

Hence, the resultant R of the body and surface forces is 


R =*= / Fp dr *— / pndar 

Jr Jff 

^ j Ypdr — j Vp djy 


(17-9) 


where the last step is obtained by making use of (8-6). 

From Newton’s law of motion, the resultant force is equal to 

f dh 

dr, (17-10) 

dr 


where r « Lc + j]?/ + kz is the position vector of the maases relative to 
Hie origin of cartesian coordinates. It follows from (17-9) and (17-10) that 

|(rp-Vp-p^)dr-0, 

and since the volume element is arbitrary and the integrand is continuous, 
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Ip - Vp. 


(17-11) 


This is the desired equation in vector form, and it is basic in hydro- and 
aerodynamical applications. 

In books on hydrodynamics, the cartesian components of the velocity 
vector dx/dt are usually denoted by u, v, and so that 

dr , dx. dy dz 

^ lu + }v + kw ^ i -hi- 

dt dt dt dt 

Since n, v, and w are functions of the coordinates of the point {x^y^z) and 
[>f the time it follows tiiat 

/Bu du dz Bu dy Bu dz\ 

[~ + -+-?. +-) 

\Bt Bx dt By dt Bz dt/ 

+ i 


if 


i 


<; 


Bv Bv dx Bv dy Bv dz\ 

..|- 1 - 1 - 1 

Bi Bx dt By dt Bz dU 


( Bw Bw dx Bw dy Bw dz 

-j-1-1— 

Bt Bx dt By dt Bz dt 


)■ 


Bu Bu Bu 

Bu 

1 dp 

- 1 - uH - V + 

— w ^ Fx — 

— 

Bt Bx By 

Bz 

p Bx 

Bv Bv Bv 

Bv 

1 Bp 

- 1 - uH -*; + 

^w ^ Fy- 

- , 

Bt Bx By 

Bz 

p sy 

Bw Bw Bw 

Bw 

1 dp 

- 1 --v + 

— w ^ Ft — 

-- 

Bt Bx By 

Bz 

p Bz 


Substituting this expression in (17-11) and setting F - iFx + Wv + kF* 
ead to three scalar equations, which are associated with the name of Euler: 


(17-12) 


It is possible to show with the aid of these equations (and by making 
K)me simplifying assumptions) that the propagation of sound is governed 
ipproximately by the wave equation 

2r,2 

a/2 ■ “ *• 

[n this equation, a is the velocity of sound and «is related to tiie density 
) of the medium by the formula 

Po 

vhere po is the density of the medium at rest. 
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Equations containing partial derivatives arise in many branches of 
mathematical physics. Fluid flow, heat transfer, wave motion, electro¬ 
magnetic theory, elasticity, quantum mechanics, nuclear physics, and 
meteorology are but a few of the fields that involve a study of such equa¬ 
tions. In this chapter w'e give representative examples, indicating some 
of the more important methods of solution. In contrast to the theory of 
ordinary differential eciuations, it will be seen that now the general solution 
is seldom sought. The main problem, rather, is to find that particular 
solution which satisfies the determinative conditions (the so-called initial 
values and boundary values) of the specific problem in hand. 

THE VIBRATING STRING 

1. Arbitrary Functions: One-dimensional Waves. A partial differential 
equation of order n is an equation containing partial derivatives of order n 
but no higher derivatives. For example, each of the three equations 

df ^ dx^ dt ^ dx^ dz^ dy^ dz^ ^ 

is a partial differential eqiiation of order 2. In this chapter we shall often 
use the subscript notation for derivatives, so that the foregoing expressions 
can be written more briefly as 

Un = a^Uxxj ~ 01 Uxxy ^xx ^yy d” ^ (Ff) 

A function u that satisfies a given partial differential equation is called 
a solution of the equation. For example, the function 

u = cos X cos at (1-2) 

is a solution of tlie first Eq. (1-1), because (1-2) gives 

Ux ^ — sin X cos atj Ut « cos x(—a sin a<), 

Uxx «* — cos X cos atf Uu «= cos x(—cos at) * 

425 
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ITie reader will recall that the general solution of an ordinary differential 
equation contains arbitrary constants; for example, the general solution 
of y'' + ^ 0 is 

^ » Cj sin a: + C 2 cos x, 

which has the arbitrary constants ci and C 2 . We shall see that many im¬ 
portant partial differential equations have solutions which contain arbitrary 
functions and, conversely, the elimination of arbitrary functions from a 
given expression often leads to a partial differential equation. 

As an illustration of this fact let 


u - fix -h y), (1-3) 

where / is an arbitrary differentiable function. If the argument of / is 
denoted by « « x + ?/»then 

u = /(ar + y) = fis) 

and the chain rule ' gives 

du df ds df 
ox ds dx ds 


Similarly, Uy ~ /X®)? and hence u satisfies 

Uj “ Uy 

for any and all choices of the differentiable function /. 
Conversely, let u(x,y) be a solution of (1-4). If we set s =»* a: -f* t/, then 
u(x,y) u(x, s — j) sa 


The chain rule gives 


Us » Ux — + h i 
dx 


ds 

dx 


r. -f u. 


( 1 - 4 ) 


and, similarly, Uy » f/,. Substituting into (1-4) we get 

f/x -f u. - r/„ 

which shows that Ux ** 0. It follows that U is a function of « only, 

U ^f(s) -fy), 

and hence the same is true of u. Thus, (1-3) follows from (1-4). 

For an example containing two arbitrary functions, let 

U =» /i(r) + / 2 (s), /i and /2 differentiable, (1-5) 

wh^ r and s are the independent variables. Then Ur = /i(r), and hence 

IJ 0 . ( 1 - 6 ) 

*The reader may find it advisable to review Chap. 3, Sec. 4. 
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(The reader can verify that also U„ - 0.) Conversely, from (1-6) we have 

^(f/,) =0 

ds 

so that Ur is independent of s: 

Ur — h{r), a function of r only. (1-7) 

If we write/i(r) = jh(r) dr, then Eq. (1-7) yields 

-/»(»•)] = 0. 

dr 

so that U —/i(r) =/ 2 (s), a function of s only. Thus, U has the form 
(1-5). 

An important example of the elimination of arbitrary functions arises 
from the situation shown in Fig. 1. If t is time, it is seen that fi{x — at) 



Fio I 


represents a wave form which propagates in the positive x direction with 
velocity a and with no change in shape, that is, with no dispersion. In a 
similar manner, f^ix + ai) represents a wave form which propagates in 
the opposite direction with velocity a. llie most general one-dimensional 
wave without dispersion is a superposition of two such, namely, 

u = fi(x — at) + f 2 {x + at), (1-8) 

Suppose, now, that u(x,t) is given by (1-8), with fi and /2 twice dif¬ 
ferentiable. If we set 

X -- at = X + at ^ 8, (1-9) 


then u = /i(r) -f /2(s), and by the chain rule 

Hfi+f2)dr difi+f2)d8 

-1-= 

dx dr dx ds dx 


fi(x — aO +f 2 (x + at). 


The reader may verify similarly that 

M( =■ /i(x - aO(-a) +f 2 {x -b at){a). 
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Differentiatmg again givra 

«a» ” flix — oi) +fiix + at), 

Utt =/iV — a0(—a)* +/2(x + o<)(o)*, 

and hratce u satisfies the partial differential equation 

utt = a^Uxz- (1-10) 

We show conversely that every solution u{x,i) of (1-10) has the form (1-8) and thus 
represents the superposition of two waves propagating with velocity a. The substitu¬ 
tion (1-0) gives 

u{x,t) « U(r,s) 

so that, by using the chain rule as in the previous discussion, 

** f/r -f U*, ut ** —aVr + 

Differentiating again 5rield8 


Uxa5 C/fT "f" -f* f/j*, 

Uu « aHJrr - 2aK\. -{- a^f/a,. 

If we substitute these values into (1-10) we get (1-6). As we have already seen, this 
ensures that IJ has the form (1-5), and hence u has the form (1-8). 

Equation (1-10) is satisfied by the most general one-dimensional wave 
motion with velocity a; and conversely, everj^ solution of (1-10) represents 
such a motion. For this reason (1-10) is called the wave equation. To¬ 
gether with its analogues in two and three dimensions, (1-10) is an impor¬ 
tant aid in the study of many vibration phenomena. 

Example: Standing Waves. The motion given by 

fi(x --at) « A sin k(x — of), A, k const, (1-11) 

represents a sine wave of amplitude A and wavelength X « 2r/k, moving to the right 
with velocity a. The period T is the time required for the wave to progress a distance 
equal to one wavelength, so that X « aT or 



Similarly, a motion described by 

Mx -h at) « A sin k{x 4- at) (1-12) 

represents a sine wave, of the same amplitude and period, moving with velocity a to 
the left. The superposition of (1-11) and (1-12) gives 


%"hioh becomes 


tt » A sin fc(a: — at) -f A sin k(x -f at) 
u "• (2A cos kat) sin kz 


(M3) 



429 


S®c. 1] THIS VIBRATINa STHINO 

when w© recall the trigonometric identities 

sin k(x dtzoi) sin kx coBkai :k coQ kx sin kat* 

The ©xpnjssion (1-13) may be regarded as a sinusoid sin kx whose amplitude 2A cos kat 
varies with the time (in a simply harmonic manner. Several curves of (1-13) are sketched 
in Fig. 2 for various values of The points nv/k remain fixed throughout the motion 



and arc called ruxie^. Although the result was obtained by superposing two traveling 
waves, the wave form (1-13) does not appear to travel either to the left or the right, 
and (1-13) is said to represent a standing wave 

The number / of oscillations or cydes made by the wave per unit time is called the 
frequency. From the definition of the period T, it follows that / = 1/T. 


PROBLEMS 

1. If w fiyfx) with / differentiable, show that 

xuj -h yiiy =» 0 for X 0. 

2. Show by direct differentiation that u « sin kx sm kat satisfies the one-dimensional 
w'ave equation for every choice of the constant k, and express tins function in the form 
( 1 - 8 ). 

3. (a) By computing w,, and Uxy obtain a second-order partial differential equation 
for u =* f\{x)f%{y). (h) Show that your result us equivalent to (log ii)«y ** 0, and explain, 

4. For many functions ilie chain rule applies even when the argument is complex. 
Assuming this, show that 

u « fi(x -h iy) -f/ 2 (x - ly), ^ « -1, 

satisfies Laplace's equation * 0. 

6. liCt /(x -j- iy) « u(x,//) 4- t^(x,y), w^here u and v are rt'al. Using the chain rule, 
show that u and v siitisfy the Cauchy-Rieinann equations 

Uz « Vy^ Uy ^ 

6. Show that u « f{ay — Px) satisfies 

atUx 4” «« 0 

if / is differentiable and a, ^ are constant. 
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The Linear Eqtiatioa with Constant Coefficients 

7. The operators Z>3 and Dy are defined by 

dx^ * dy” 

and we agree also, for example, that 

(aDx 4" 0Dy)U ® aDxU 4" 0OyU ^ CXUx 4" 0Uy- 

(а) If mi are constant use the result of Prob. 6 to solve each of the equations 

(Dx — rniDy)u ** 0 , (Dx — m2Dy)u «* 0 , 

(б) Show that both solutions obtained in (a) satisfy 

{Dx ~ miDy){Dx ~ m2Dy)u « 0 (1-14) 

Hini: Since nii are constant, ICq, (1-14) may also be writt/cn 
(Dx — m2Dy){Dx — miDy)u = 0. 

(c) Deduce that a solution of (1-14), containing two arbitrary functions, is 

14 « Fi(i/ 4~ miT) 4- 4“ mx), mi 9 ^ m 2 , 

u « Fiiy 4* mix) 4- xF2(y 4- mix), mi * m 2 . 

Hint: Since the equation is linear and homogeneous, the sum of the two solu¬ 
tions in (h) is again a solution. The result for mi «= m 2 nmy be verified by 
direct substitution. 

{Similar results hold in general. The solution of 

(Dx — mDyYu 0 

can be shown to be 

u Fi(y 4- mx) 4- xF2{y 4- mx) -f . . . >f x^-^Fr(y -f- mx) 

and the solution for several such factors is obtained by addition (cf. Chap. 1, Sec. 21). 
The process gives the “general solution” in that the number of arbitrary functions 
equals the order of the equation.] 

8. The fourth-order equation 

d*n d*u df*u 

dx^ Ox* 0 /y* dy* ^ 

occurs in the study of elastic plates. Show that the general solution is 

u - Fiiy - ix) 4- xF2(y - ix) -f- Fz{y 4- ix) 4 - xFi(y 4 - ix). 

Hint: The equation may be written 

(Dt 4- 2DlDl 4 - Di)u 0 
so that the decomposition into linear factors gives 

(Dx 4- iDy)(Dx 4- tDy)(Dx - iDy)(Dx - iDy)u - 0. 

Us© the result of Prob. 7. 
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9. As in Probs. 7 and 8, solve; 

(a) « 0; (6) -f « 2ww; 

(0 Wxz + ** Oi (d) 4“ Wyi/ ** 2Wxy. 

10. Consider the equation 

Uxx 4- 4uxy - « f{x,y). 

(o) By the method of Prob, 7 obtain a general solution when / » 0. 

(6) By assuming u «« cy'*, where c is a constant to be determined, obtain a par¬ 
ticular solution when J ^ y^. 

(c) Similarly, obtain a particular solution when / « 

(d) By addition of the results (a), (?>), (c) obtain the general solution when / *» 
2/2 4- X. 

11. As in Prob. 10, obtain the general solution; (a) 2zxx 4- * 1J W «xx 

« x^; (c) Zxx 4- 32x1, 4 2zyy «» T + y. 

2. Derivation of a Differential Equation. Consider a flexible, elastic 
string stretched between two supports on the rr axis (Fig. 3). To obtain 



a difTerential equation for the motion, let v{x,i) represent tlie vertical 
distance from the point x on the x axis to the string at time f. We shall 
apply Newton's law, 

(Mass) (acceleration of center of mass) =* force, (2-1) 

to the short piece of string between x and x 4* Ax. 

The mass of the short piece is 

Mass = p As 

where ? is the mean density and A^v the length. The vertical component 
of acceleration for the short piece is 

Vertical acceleration ~ —- 
dt^ 


if u is the height of the center of mass above the x axis. To compute the 
vertical component of force we let T be the tension, and we introduce the 
angle 6 between the tension vector and the x axis. By Fig. 3 the vertical 
component is 


Vertical force due to tension *= {T sin S) 


- (T sin $) . 
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If there is an additional vertical force Fi(x,t) 6x due to other causes, 
substituting into (2-1) yields 

P As— ^ (T sin e) — (T sin &) -f Fi(x,t) Ax. 
dr X 

Upon dividing by Ax and letting Ax —> 0 we get 

ds d^u d 

- —{T sin^) + Fi{xfy (2-2) 

dx dr dx 


if the required derivatives are continuous. 

To obtain a simpler equation, note that the definition of arc length 
yields ^ 

ds f 


and also * 


dx 


1 


sin d = tan d(l + tan^ d) ^ = Ux{l + ul) ^ ^ Ux, 


if Moreover, if the displacement u is small, we can consider 

T « const. Substituting into (2-2) yields the approximate equation 


putt == Tuxx + Fi(x,t). 

This in turn may be written 

utt == a^Uxx + Fix,t), (2-3) 

where a «*= VT/p and F{Xft) == p'~*Fi(x,0. 

Equation (2-3) will be considered in the sequel under the assumption that 
p, and hence a, is constant. 

When the force function F(x,0 is zero, the vibrations of the string are 
termed free vibrations. By (2-3) the equation for free vibrations is 

Utt = a^Uxx (2-4) 


and hence the solution has the form (1-8). According to the discussion 
in Sec. 1, the motion can always be regarded as a superposition of two waves 
moving with velocity 


a = 



( 2 - 6 ) 


* The symbol ^ (read “is asymptotic to") means that the ratio of the two sides tends 
to 1, A discussion of this useful notation is given in Chap. 1, Sec. 2. 

•The fact that the string is flexible means that the tension vector is tangent to the 
dtring, so that 

dx* 


tan 0 slope of curve 
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in opposite directions. Later we shall determine the precise form of these 
waves by considering the initial state of the string, that is, the state at 
t » 0, together with the conditions at the end points, x ^ 0 and x I 
Inasmuch as the constant a in (2-4) involves only the ratio T/p, two 
strings may beliave similarly even if made of different materials. For 
example, a string with density 2p under tension 2T behaves like a string 
with density p and tension jT, since both yield the same value for a. An 
(^uivalence of two different physical systems such as this is sometimes 
called a principle of similitude. 


The study of similitude belongs to an interesting branch of mathematical physics 
known as dimemicmol analysis. Although a general development ^ will not be given 
here, we shall describe the underlying idea as it applies to (2-5). 

Equation (2-5) rehites three quantities a, T, and p which are expressed in different 
physical units, lu the mks system * 


f meters*] 

["kilograms "I 

^ r kilogram-meters! 

L second J' 

^ L meter J * 

L (second)® J 


where the square bracket is used to indicate that the measuring unit, rather than the 
value, is being describ<jd. The value of a for use in (2-5) is the number of such measur¬ 
ing units, that is, the number of meters per second, and similarly for p and T. 

If we decide to measure lengths in centimeters rather than in meters, then the value 
of a will be increased by a factor 100. In other words, 100a cm per sec is the same as 
a m per sec. Similarly T will be multiplied by 100, but p will be divided by 100, since 
the length unit for p in (2-6) occurs in the denominator. (Indeed, p kg per m is cleai ly 
the same as 0.0Ip kg per cm ) Hence w'hen a string has a wave velocity a, density p, 
and tension T in the old system (2-6), then the same string has velocity, density, and 
tension 

lOOo, loor (2-7) 


in the new system. Substituting into (2-5) yields 

Imr 

100a ■■ \ - 7 TT- 

V p/100 

which is consistent with (2-5), as it should be. One does not get a contradictory result 
by measuring all lengtiis in centimeters rather than in meters. 

When we change meters into centimeters, we divide the unit of length by 100. More 
genially, one might divide the unit by an arbitrary positive constant a. The new 
values of a, p, and T would be, respectively, 


oa, aT, (2-8) 

a 

(compare (2-7)]. Similar changes may be made in the units of mass or of time. £qua- 

‘ The reader is referred i>o P. W. Bridgman, ‘‘Dimensional Analysis,” Yale University 
Press, New Haven, Conn., 1931, and S. Drobot, The FoundaUons of DimenBiiMial 
Analysis, Studia Maih.^ 14:84-99 (1954). 

*The mks units for P can be found from Newton’s law (2-1), since P is a force. 
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tim (2^) remains self-consistent under such changes, as the reader can verify* The 
question arises: Is (2-5) the only functional relationship 

a«^f(p,T) (2-9) 

which is consistent under such changes? If so, then we would have a proof of the func¬ 
tional relation (2-5), assuming merely that there m a functional relation of some kind. 

To investigate tins possibility, suppose ( 2 - 9 ) holds where / is an unknown function 
and where a, T, and p stand for the numbers of their respective units of measurement in 
(2-6). If the unit of length is divided by a, then (2-8) gives 


aa 



upon substitution into (2-0). Since a is arbitrary we may choose « « p to find 


pa«/(l,pT). (2-10) 

If we now divide the unit of mass by /3, the value of a is unchanged but p and T be¬ 
come ^ and /JT, respectively [see (2-6)]. Substituting into ( 2 - 10 ) yields 

/Spa -/(l,/3^pr). 

Upon choosing /S* « (pT)"“\ we get 

(pr)-V -/(1,1), 


so that 



(2-11) 


where c /(1,1) is a consUini, independent of a, p, and T. 

Finally, if we divide the unit of time by 7 , the new values of a, p, and T are given by 
( 2 - 6 ) as a/ 7 , p, and T/y^, Substituting into ( 2 - 11 ) gives 


a 

y 



which reduces to (2-11) again. Thus, no new information is obtained by changing the 
unit of time, and the constant c in ( 2 - 11 ) cannot be found by dimensional analysis. 
But we can determine c by considering the limiting case of small oscillations. The 
partial differential equation (2-4) is then valid, and (2-5) shows that c « 1 . 


PROBLEMS 

1. The displacement of a certain string is 

w(a :,0 * fi(x - at) -f/ 2 (a: -f at). 

What is the physical meaning of the condition u( 0,0 » 0 ? If w(0,0 » 0, express fi in 
terms of / 2 , and thus deduce 

u(x,t) « fi(x -f at) - f 2 i-x -f at). 

% (a) Tindf(x - at) when/(a;) « (1 -f when/(a:) « sin kx; whmf(x) * c* In 
aach tfase compute also f'(x — at). HifUi Substitute a; — of for x in the expressions for 
/(x), ( 6 ) If u(x,t) «/(x - at) -f/(x 4 * at), find w(0,f), u(x,0), and w(^,l/a) for each/(x) 
in part (a) of this probleip. 
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B. In the derivation of (2-3) w© observed that sin 0 but we used this result in 
the form (sin (?)* (u*)«. (a) By differentiating the exact formula 

sin » Wic(l -f 

with respect to x, show that (sin 0)x ^ Wxx is correct provided Vxz is bounded. Also 
show that the error is of the order of wj in this case. (6) By c^cmsidering w « 1 -b a;, 
» » 1 + 2x near x *« 0, show that the equation u does not always enable us to 
conclude w* ^ t>z- (In otlu^r words: If two functions approximate each other the de¬ 
rivatives need not approximate each other and a separate investigation must be given,) 
4. Show that the small longitudinal vibrations of a uniform long rod satisfy the dif¬ 
ferential equation 

dt^ * p dx*' 


where u is the displacement of a point originally at a distance x from the end of the rod, 
E is the modulus of elasticity, and p is the density. Hint: From the definition of Young’s 
modulus E, the force on a cros.s-8ectional area q at a distance x units from the end of 
the rod is Eq(du/dx)^ since du/dx is the extension per unit length. On the other hand, 
the force on an element of the rod of length Ax is pq Ax d^il/dt^. 

6 . If the rod of Prob. 4 is made of steel for which E — 22 X 10® g per cm^ and whose 
Sj^cific gravity is 7.8, show that the velocity of propagation of sound in steel is nearly 
5.3 X 10® cm per sec, which is about sixt-een times as great as the velocity of sound in 
air. Note that in the cgs system E must be expressed in dynes per square centimeter. 

6 . Show that the differential equation of the transverse vibrations of an elastic rod 
carrying a load of p(x) lb per unit length is 


El 


dx' 


p(x) - m 


d^y 


where E = modulus of elasticitj’^ 

/ = moment of inertia of cross-sectional area of rod about a horizontal transverse 
axis through center of gravity 
m = mass per umt. length 

Ilini: For small deflections the bending moment M about a horizontal transverse axis 
at a distance x from the end of the rod is given by the Euler formula M « El d^y/dx^, 
and the shearing load p(x) is given by (fMjdx^ = ;>(x) 

3. Initial Conditions. In the previous section the wave equation 

Utt = a^Uxxy CL = const, (3-1) 

was derived for small displacements of a uniform flexible string. Ac¬ 
cording to Sec. 1 the general solution of (3-1) is 

u{x,t) ^ fiix — at) + f^ix + at) (3-2) 

where fi and /g are arbitrary twice-differentiable functions.^ We shall 

^ Actually, (3-2) is meaningful whenever/i and ft are well defined, and hence, condi¬ 
tions of differentiability are not emphasized in the sequel. A nondifferentiable function 
(such as the function shown in Fig. 4) is regarded as lieing a ‘^solution” of (3-1) if it can 
be approximated, with arbitrarj*^ precision, by smooth solutions. See: I. G. Petrovsky, 
“Partial Differential Equations,” p. 65, Cambridge University Press, New York, 1954. 
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QQW see that these functicme can be determined from the initial c(Hiditi<>nB, 
that is, from the conditions at time < >= 0. It is convenient to regard the 
string as infinite and the conditions as given for — « < a: < «, The 
effect of the end points a: = 0 and x = I will be considered in Sec. 5. 



Fio. 4, Ordinates on the resultant wave are obtained by forming one-half the sum 
of the oppositely moving waves shown by the dashed lines. 


Case I. Initial Impulse 0. Assume that the string is released from rest 
and that the initial stiape is given by a known function/(x). (Such a sit¬ 
uation arises when the string is plucked, as in a harpsichord.) In symbols, 

u(x,0) = fix), «((x,0) = 0, (3-3) 

where the second Eq. (3-3) expre-sses the fact that the vertical velocity 
du/dt is initially 0 for each point x of the string. By (3-2) we get 

utix,i) = —afiix — at) + oji(x -f at) (3-4) 

upon usdng the chain rule as in Sec. 1. Since u«(x,0) = 0, Eq. (3-4) gives 

/1(X)»/2(X) 

after dividing by a. It follows that/ 2 (x) =* /i(x) -1- c, where c is constant. 
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Using this equality with x replaced by x + ai we see that (3-2) may be 
written 

u{x,t) « /i(x - at) +fi(x + at) + c, (3-6) 

This step is sometimes puzzling when encountered for the first time; namelyi from 

Mx) « fi(x) -f- C 

how can we deduce f 2 (x ^ at) » /i(x -j- at) c? The conclusion follows because the 
first equation holds for all values of x (and the conclusion would not follow otherwise). 
One cannot simply set x •» x -f at, because that would lead to of 0. But one can 
reason as follows: We have /ifx) *= /i(x) 4* c for all x. Hence / 2 ( 8 ) *« /i(a) 4* c for all 
8, and the choice s =* x 4 - yields the desired result. 

So far we have used only the second initial condition (3-3). To ensure 
the first condition, u{xfi) = f{x)f we set ^ = 0 in (3-6) and equate the result 
to/(x'), thus: 

fi(x)+fi{x) + c^f{x). 

It follows that fi(x) == }if{x) — He, and substituting into (3-5) gives 
the final answer: 

w(j,0 « H/(^ - + Hf(^ + oO- (3-6) 

The displacement ?^(.T,f) in (3-6) is the sum of two waves, each of the 
form which travel in opposite directions with the velocity a. In¬ 

itially (that is, for t = 0) these weaves coincide, but with the passage of 
time they diverge, the wave ^' 2 /(x — at) moving to the right and the other 
to the left. In particular, if the waves are of finite extent, then any given 
point of the string is at rest in the initial position aft^r the passage of both 
waves. The situation is illustrated schematically in Fig. 4 when /(x) is a 
triangular wave on 

Casb: II. Initial Displacement 0. Suppose, next, that the initial dis¬ 
placement is 0 but that the initial velocity is not 0. (Such a situation 
arises when the string is struck, as in a piano.) If the initial velocity is 
g{x) at point x of the string, the initial conditions are now 

u{x,0) « 0, w,(x,0) = g{x), (3-7) 

The first Eq. (3-7) gives 

/i(^) +/2(^) = 0, 

when we recall (3-2), so that / 2 (x) = -fi(x) for all values of x. Using 
this equality with x replaced by x + at, we see that (3-2) may be written 

u(x,t) == /i(x - at) - /i(x -f- at). (3-8) 

Differentiating (3-8) with respect to t and setting t » 0 yield 

w<(aj,0) = -afiix) - afi(x) - g{x) 
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when we use the second condition (3-7). It follows that 

fM - --I fg(s)ds + c, (3-8) 

2a •'0 

where c is constant, and hence (3-8) gives the final answer 

1 1 rx^at 

u{x,t) = - — ds + -- / g{s) ds. 

2a ■'0 2a ‘'o 

The result may be expressed more compactly as 

I /x + af 

u(x,i) = — / g(s) ds. (3-10) 

2a 

Equation (3-8), like (3-6), represents a superposition of two waves 
traveling in opposite directions. Here, however, the siiapes of the waves 

are determined by/i(x) and ~-/i(r), 
w)ii(d) are mirror images of each 
other in the x axis. Moreover, the 
shapes are not found directly by the 
initial condition but are obtained 
through the integration (3-9). For 
this reason the waves may be of infi¬ 
nite extent even when the initial 
impulse Ut(x,0) = g(x) is confined to 
a finite portion —k<x<koi the 
string. Indeed, for such a choice of 
g(x) formula (3-10) shows that any 
given point x of the string eventually 
suffers a permanent displacement 

l\g{s)ds. (3-11) 

This is the case because when 
at > A' +1X1, the interval (x — atj 
X 4- at) contains the interval {—k,k). 
Inasmuch as g{x) — 0 outside the 
interval (—A;,A:), the integral (3-10) 
is then equal to (3-11). Since each 
given point of the string eventually 
moves the same distance (3-11), the 
part of the string that is again at rest forms a straight line parallel to the 
original string. It is most interesting that this happens regardless of the 
dwice of g(x)j provided only that g{x) ~ 0 outside some finite interval. 
Graphical illustration is given in Fig. 5 for the case g(x) =« 1 on (~ifc,jfc). 
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Case III. Arbitrary Initial C<fnditions, Suppose, now, that both the 
initial displacement and the initial velocity are given by arbitrary functions 
of X, so that tlie initial conditions are 

u{xfi) == fix), utixfi) =* gix). (3-12) 

This problem can be solved by superposition of the two solutions previously 
obtained. Indeed, let v{x,t) and wix,t) satisfy the wave equation (3-1) 
and the respective initial conditions 

= fix), Vtixfi) = 0, 
wixfi) = 0, wtixfi) = gix). 

Then tiie function 

uix,() = vix,t) + wix,t) 

satisfies the wave equation because v and w do, and addition of the relations 
(3-13) shows that t/ satisfies (3-12). Since the wave equation was solved 
in the previous discussion subject to initial conditions of the type (3-13), 
addition of the two solutions obtained formerly gives the solution desired 
now. That is, 

1 1 1 rz-\-at 

'^ixJ) = -fix - at) + - fix + at) + — / gis) ds. (3-15) 

2 2 2a 

The expression (3-15) is known as d'Alembcrf a formula; it satisfies (3-1) 
and (3-12), hence gives the motion of a string subjected to arbitrary initial 
displacements and velocities. 

The formula (3-15) can also be used to find the displai*ement of a semi- 
infinite string (0 < X < co) fixed at x = 0. If the initial displacement and 
velocity of a semi-infinite string are 

w(x,0) = /(x), wt(x,0) = ^(x), X > 0, (3-16) 

w^e can imagine an infinite string for which the initial conditions in the 
interval (0 ,qo) coincide with (3-16) and in the interval ( — 00 , 0 ) are deter¬ 
mined by 

w(x,0) == ~/(lxi), Wf(x,0) = -“^?(|xl), X < 0. (3-17) 

The point x = 0 of an infinite string, moving in accord with (3-16) and 
(3-17), will obviously be at rest, and the behavior of the infinite string for 
X > 0 will be identical with that of the semi-infinite string. 

The superposition method does yield a solution of the problem but does not establish 
the uniqueness of tlmt solution* We shall now show that every solution of (3-1) and 
(3-12) can bo represented in the form v -f w, with t? and w as in (3-13). Since v and w 
were idready shown to be unique, it w*Ul follow that u is also unique. 


(3-13) 

(3-14) 
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Indeed^ let u(x,t) be a solution of the wave equation (3*1) which Satisfies the initial 
conditions (3-12). Let v(Xft) be the unique solution of (3-1) satisfying the first condi¬ 
tions (3-13). Then the function u)(xfi defined by 

w(Xft) » w(x,0 — v(x,0 

satisfies (3-1) and the second set of initial conditions (3-13), as the reader can verify. 
It follows that w is uniquely determined and hence u(x,t) is also uniquely determined. 
Because of uniqueness, (3-15) describes the hehamor of the string Without uniqueness, 
we could only say that (3-15) describes a ‘possible behavior of the string, 

PROBLEMS 

L The displacement of a siring is given by the traveling wave 

u(x,0 *» sin (x — at). 

What are the initial displacement and velocity? Verify, by actual substitution into 
(3-15)* that your initial values yield the correct re.sult, u{x,t) « sin (x — at). 

2. For a freely vibrating string the initial displacement and velocity are, respectively, 
sin X and cos 2x. Find the displacement and velocity of the point x *» 0 when 1 tt. 
Hint: First find w(x,0 from (3-15). 

3. A fretdy vibrating string was subjected to an initial displacement 6 cos 5x and 
initial velocity 0. One second later it is found that the point x « 0 is displaced thret‘ 
units from the equilibrium position; that is, w(0,l) « 3. What can you say about the 
velocity of propagation for waves on this string? 

4. The initial velocity of a freely vibrating string is xc""*“. For what choice of the 
initial displacement (if any) does the re.suUing motion represent a traveling wave travel¬ 
ing in the positive x direction? Hint: It is desired that u{x,i) « fi(x — at). Determine 
fi from the initial velocity, and then determine the initial displacement from/]. 

5. Holve Prob. 4 with the words ^^velocity'* and ^Misplacement'' interchanged. 

6. A stretched infinite string is struck so that its segmiait — 1 < x < I is given an 
initial velocity 1. Use (3-15) to find the displacement and sketch the displacement 
curves for t ^ 1/a and i » 2/a. 

7. The initial displacement and velocity of a semi-infinite string are w(x,0) »• sin x, 
%(x,0) » 0, 0 < X < w. Find nix,t) for i > 0. Also find u{xfi if w(x,0) * 0, Wi(x,0) « 
—2a cos X, 0 < X < 00. 

4* Characteristics. A physical interpretation may be given not only by 
plotting u(x,t) versus x for a succession of value.s of t, but also by consider¬ 
ing the xt plane. Each point of the xt plane represents a definite position 
on the string at a definite time t. If w'e take f = 0 to be the present time, 
then the half planCvS t < 0 and i > 0 give the past ^ and future, respectively. 

Since the speed of propagation is a, the disturbance at (x,0 will reach 
a point (xo,to) given by 

^ Although it is not appropriate to {)crmit t <0 when the string is plucked or struck 
at f »• 0, it is appropriate if the string has been in motion for some time and the initial 
conditions are determined by high-speed photography. We could then take the view¬ 
point that we are trying to ascertain the past history of the string by observations on 
tktti present. 
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X Xo X -- Xfi 

- - m a or -» —a, 

t-k t- to 


(4-1) 


for the direct wave /i(x — at) and the opposite wave f^ix + at), respec¬ 
tively. Equations (4-1) may be written 


X — at ^ Xo — atoy x + at ^ xo + ato- (4-2) 

If we draw the two lines (4-2) through the point (xo,^), as shown in Fig. 6, 
their intersection with the x axis 
(that is, t - 0) gives those points on 
the string for which the initial con¬ 
dition contributes to the disturb¬ 
ance at (xo,/<)). The lines (4-2) are 
called the chanKierutics of the par¬ 
tial differential equation (3-1). 

Along the first line (4-2), x — at 
is constant, and hence /i (x — at) is 
constant. Thus, the deflection due 
to the direct wave is the same at all points of the first characteristic (4-2). 
I'he second line serves the same purpose for the opposite wave, and we 
can say, briefly, that the disturbance travels along the characteristics. 

If the initial disturbance is confined to some interval (xi,r 2 ), theoi we 
have the situation shown in Fig. 7. The xt plane is divided by the charac¬ 




teristics into six regions. In region 1 the points receive the disturbance 
from both waves, in II only from the opposite wave, and in III only from 
the direct wave. The points in IV and V are too far away to receive any 
disturbance at the corresponding times, and the points in VI are at rest 
because both waves have paased. That is, if P is a point in the region 
VI, then tlie characteristics through P (showm dashed in the figure) in¬ 
tersect the X axis outside the interval (xi,X 2 ). Hence the initial displace¬ 
ment at these points is zero, and we need consider the initial impulse only. 
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Since the characteristics intersect outside the interval (^ 1 , 3 - 2 )^ the dis¬ 
placement at P due to the initial impulse is given by the (constant value 
(3-U). 

We have seen that the initial conditions determine botli the direct wave 
and the opposite wave at each point on the x axis wliere these condit/ions 
are given. Since the disturbance propagates along the characteristics 
the following theorem is suggested: 

Theorem I. Let u and ut be given on ike interval 8, and 

suppose utt = (Puxjc, Then u(Xji) is 
uniquely determined in the shaded 
region but is not^ uniquely determined 
at any other point. 

Both the initial displacement and 
the initial velocity have to he speci¬ 
fied in Theorem just as one would 
expect intuitively. It is a remark¬ 
able fa{‘t that the displacement alone 
(v^ithout the velocity) >\ill deter¬ 
mine the solution, provided this dis¬ 
placement is given along two intersecting characteristics in the xt plane. 
Indeed, let u{xd) be given along (xi,P) in Fig 8. Since the direct wave 
fi(x — at) is constant on we can ascertain the shape of the reverse 

wave f 2 (x + at) along (xi,P). This, in turn, gives/ 2 (x + at) along (xi,X 2 ), 
because the disturbance f 2 {x + at) propagates from (.rj,.r 2 ) to {xi,P) along 
the characteristics parallel to {X 2 ,P) (see the dashed line in tlie figure). 
In just the same way, when u(x,t) is given on ( 32 ,/^), we can determine 
the shape of the direct wave/i(3: — at) on ixi,X 2 ). Tims, we are led to the 
following theorem: 

'I'heorem II. Let u he specified along the two intersecting characteristics 
{xi^P) and (x 2 ^P) in Fig. 8, and suppose that Uft = a^Uxx- Then u{xd) is 
uniquely determined in the shaded region hut is not uniquely determined at 
any other point. 

Theorems I and II are the fundamental existence and uniqueness theorem 
for the wave equation, deduced here by physical considerations. A simple 
mathematical proof of the same results is given in Sec. 25. 


Fig, 8 


6* Boundary Conditions. We now suppose that the freely vibrating 
string is not infinite but is stretched between two points of support (Fig. 
3). When the supports are on the x axis and do not move, the situation 
is described by 

r(0,0 « 0, u{lfy ^ 0 for all L (5-1) 

These are called boundary conditions^ because they refer to the boundary 
points of the interval (0,f) in winch our pb3n3ical problem is defined. Al- 
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SEC. 5] 


uniquely, they do enable us to establish some of the most interesting and 
important properties of the motion. Hence, in this section we see what 
can be deduced from (5-1) alone. In the next section we use (5-1) together 


with appropriate initial conditions. 

Physically, one would expect the 
until the disturbance created by the 
ends reaches the point of observa¬ 
tion. In terms of Fig. 7, the ends 
X == 0 and x = I have no effect in 
the region I provided the points 
a* = 0 and x = / lie outside the 
interval (.^ 1 ,^ 2 )- Wiien the disturb¬ 
ance reaches an end point, however, 
it is reflected, and the reflected 
wave must eventually be taken into 
account. 

Because the end point is fixed the 
incident and reflected weaves have 
algebraic sum 0 at the end point, 
and hence there is a 180'^ phase shift. 
A wave of type / 2 hr + at) becomes 
a w^ave of type ~-/ 2 (—.r + at) upon 
reflection at x == 0, for example (see 
Fig. 9). The change of sign m /2 
expresses the phase shift, and the 
change of sign in x indicates that 
the reflected wave 

g(x - at) ^ -f2(-x -f at) 


string to act like an infinite string 



Fig 1) 


propagates in the opposite direction. 

When the wave is reflei'ted again at x = I, we g(‘t another minus sign 
in each case, and hence the original wave foix + at) is restored (Fig. 9). 
Since the velocity is a and the length of the round-trip path is 2Z, the time 
for a round trip is 

21 

Period of vibration = (5-2) 

a 


In terms of/ 2 (af) the periodicity condition means that 

/2(tt/) = /2 l^a ^ j = f2{at + 21). 

Similar remarks apply to /i(t), and hence we expect that both /i(x) and 
/ 2 (x) will be periodic functions ^ with period 2L 

^ A function fix) has period p if f(x -f p) /(x), where p is a nonzero constant. 
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To discuss the boundary conditions mattematically, let us think of 
the finite string as being in reality an infinite string which vibrates in such 
a way that the points a; « 0 and x ^ I remain fixed. The formula 

u{x,t) ^ fi{x — at) +f 2 {x + at) (5-3) 

holds for all solutions of the wave equation. Letting x = 0 gives 

0 -/i(-aO +f 2 (at) 

when we use the first boundary condition u(0,t) == 0. This shows that 
f^is) ar —/i( —s) for all s, and hence (5-3) becomes 

u(x,i) = fi(x ~ at) - at). (5-4) 

Thus the effect of the boundary condition at x — 0 is to reduce the number 
of arbitrary functions from two to one. 

The second boundary condition applied to (5-4) gives 

0 ^h{l-at) ^fii-^l-at) 
or, if we set s « — at, 

0^Ms + 2l) - Ms). (5-5) 

Since i is arbitrary, so is s, and hence fi (x) has px^riod 21. (This agre<^s 
with the surmise we had formed on physical grounds.) In view of (5-4), 
we can summarize our result as follows: 

Theorem I. Suppose an in finite string vibrates freely in such a way ttiat 
the points x ^ 0 and x = I remain fixed. Then the dispkcement u{xd) is 
periodic both in space and in time. The two periods arv^ respectively, 21 
for X and 21/a for t if a is the lyelocity of propagation. 

Hence if a string is stretched between two fixed p^oints, the free vibrations 
are p)eriodic no matter what Ifw initial conditions may be. Since a pxTiodic 
vibration is generally pcrceiv^cd as musical, tliis fact is of great importance 
for the development of musical instruments. 

Theorem I asserts that the motion wull rcp)eat after a time 2l/a. Heni^e 
if the minimum t pieriod of a vibrating string is determined by observation, 
that minimum pieriod will not be longer than 2l/a. It may be shorter, 
however. For instance, the function 

u(x,0 = sin 27rx/l cos 2vat/l 

satisfies (5-1) and the wave equation, hence represents free vibrations of a 
string of length 1. But the minimum t p>eriod of this function is l/a rather 
than 2l/a. The shorter p)eriod is explained by the fact that ^ 0; 

tjiat is, the center of the string is a node. The center does not move, 
and the string acts like two strings of length 1/2 placed end to end. We 
shall now show that there is always at least one node if the pxjriod is smaller 
than that given by Theorem I. 
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Thboebm IL // the string considered in Theorem 1 has an z period 2p 
or t period 2p/a, wh^re 0 < p < then the point x » p must he a node. 

Suppose, first, that the x period is 2p. Then, in particular, 


u(p,t) «. u(-p,l). 

On the other hand (6-4) givee u(x,t) «■ —u(—i,t); hence 


u(p,l) - -u(-p,t). 

(6-7) 


By addition of (5'-6) and (5-7) we get u(p,t) « 0, which shows that x « p is a node. 
Suppose, next, that the i period is 2p/a. The equation 



combines with (5-4) to give 

fi(x - at - 2p) - /i( - ai - 2p) « Mx - at) - /i( -x - at). 

Tf we let X 4* of “ 0 and x — of — 2p »« «, the equation reduces, after rearrangement, 
to 

fi(8 -}~2p) -/i(s) = c, (5-8) 

where r = /i(0) — M — 2p) is constant. Equation (5-8) shows that /i(a) increases by 
the amount c whenever s increases by 2p. If c 5*^ 0, it follows that 1 / 1 ( 5 ) j is unbounded. 
However, /i(s) has period 2/ by (5-5), hence is hounded, and this shows that c « 0 in 
(5-8). The choiee « =* —p — of in (5-8) with c » 0 leads to the desired result: 

u(p,0 «*/i(p - at) at) « 0. 

To illustrate the use of Theorem II, suppose a 2-in.-diameter steel cable 
ICK) ft long is ol)servcd to vibrate without nodes at llic rate of two complete 
(‘>’cU»s per second. According to Theorem I the t period is 21/a or possibly 
less. But Theorem II shows that the period is not less, since the motion 
was observed to have no fixed points, lIon(‘e 

1 100 

- =:= 2 — 

2 a 

which gives a = 400 fps. This is the velocity with which waves are 
propagated along the cable. Since tlie density of steel is about 480 lb 
per fr^, the weight of I ft of cable is 

480?r(K2)^l - - iOlb per ft. 

This gives p = slug for the linear density, and hence the tension is 
T « a^p - (400)2(i;> 32) = 50,000 lb. 


PROBLEMS 

1. An infinite string vibrates frt^ely in such a way that the two points x •* 0 and 
X m I remain fixed; that is, a(0,0 *« u(i,0 * 0. Are any other points of the string neces¬ 
sarily fixed? Which ones? 
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% Suppose a freely vibrating string of length I has just one node at « p between 
0 and L Show that the node must be at the mid-point. (The analogous result forn 
nodes is also true.) Hint: If p < 1/2, apply Prob. 1 to the two points a: *» 0, a; « p. 
If p > 1/2, apply Frob. 1 to the two points ar » p, a; ^ 

3. A cable of length I ft is made of a material with density d lb per ft®. It is found 
that the cable makes 10 complete oscillations in t sec. Sliow that the cross-sectional 
stress is 

<r » 0.087d I>si 

provided the oscillations do not have a nodc3 between 0 and I, How would the result 
change if the mid-point remains fixed during the observed oscillations but no other 
point rt^mains fixed? 

4. I^et hit) be a given function of i. (a) What is the physical meaning of the boundary 
condition u{l,t) ** h{i)*f (b) Describe a physical problem that would lead to the boundary 
conditions u(ac,0) » 0, ulx,k(t)\ « 0. 

8. Initial and Boundary Conditions. We shall now consider the free 
vibrations of a string satisfying the boundary conditions 

n(fl,i) = 0, u{l,i) = 0 for all t, (G-1) 

together with the initial conditions 

u(xfi) = f(x), 'Utixfi) = g(x) ior 0 < X < L (G-2) 

As in the preceding section we regard the finite string as being an infinite 
string with nodes x — 0^ x — 1. According to (5-4) and (5-5), the lioundary 
conditions give 

v{x,t) = /i(x — at) ~-/i(—X — at) (6-3) 

where/i(x) has period 21, and conversely, (G-3) ensures (0-1). The initial 
conditions (G-2) are prescribed on (0,/) for the infinit-e string, and our task 
is to fissign initial conditions outside the interval (0,/) in such a way that 
the solution has the form (G-3j. 

Denoting the unknown initial conditions for the infinite string by fo(x) 
and go{x)f we have 

Mor) = fix), goix) = gix), 0 < z < I, (6-4) 

because the infinite string is to agree with the finite string on (0,Z). Upon 
setting ^ 0 in (6-3) we get 

foi^) =/iW -fii-x). 

Similarly, differentiating (6-3) with respect to t and putting i = 0 give 
go(x) ^ -af^(x) + afi(-x). 

These expressions show that * 

* A function 4>(x) even if j) ® 4>ix), odd if <f>{x) m —<t»(x). An analytical and 
graphical discussion of such functions is given in Chap. 2, Sec. 19. 
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( 6 ^ 5 ) 


/o(x) and g^ix) are odd functions. 

Hence, /o and are determined on (—Z,0 by their values on (0,Z). 

Finally, since /o(x) and go{x) are expressed in terms of the function/i(a:), 
which has period 2Z, we see that 

/o(x) and gfo(x) have period 2Z. (6-6) 

Thus, /o and gfo are known everywhere as soon as they are known on (-“Z,Z). 
According to (3-15), the solution is 

111 rx+ai 

« ~fo(x - at) + -fo(x + at) g^is) ds. (6-7) 

2 2 2a 

If /o(^) and go(x) in (6-7) are determined by (6-4) to (6-6), it is easily 
verified that this function u(x^t) satisfies the wave equation, the initial 
conditions (6-2), and the boundary conditions (6-1). Thus, (6-7) is a 
simple and explicit expression for the motion of a vibrating string with 
fixed end points. 

The oorresponclf'noe between the finite string and the ixifinite string leads to an 
interesting geometrical (‘onstruction for getting the disturbance at any point P of the 
strip 0 < r < / in the xt plane (Fig. 10). For the infinite string the disturbance at P is 
found by drawing characteristics as in Sec. 4 (see solid lines in Fig 10). Since the initial 



conditioas for the infinite string are obtained from those for the finite string by (6-4) to 
(6-C), the same result may be found by following the dashed lines in Fig. 10. To take 
account of (6-5), however, we must introduce a changed sign upon each reflection at the 
boundary. I'he disturbance at P arises from the initial disturbance at x'l and xi, sub¬ 
ject to the above-mentioned convention regarding sign. This reflection of the charac¬ 
teristics in the boundary linos x » 0, a; / is quite analogous to the reflection of waves 
at the end points of the string. 
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Thcr f>r<>o©<iure iltotrated ia Fig- 10 is an example of the method of images^ m called 
because the initial conditions for the infinite string are obtained from those for the 
finite string by forming repeated mirror images in the lines ^ 0, a; « 0, and a: «• f. 

Example: Biscua^ the free oscillations of a string of length I which satisfies the initial 
conditions 

7liric 

n(a;,0) /« sin -y-i Wt(x,0) » 0, 

where n is an integer and fn is constant. 

^nco sin rnrx/l is odd and has period 21, we may take 


/o(x) «« fn sin —» goix) « 0 


as initial conditions for the associatcjd infinite string. Equation (6-7) now yields the 
solution 


u(x,t) 


fn . nir(x ~ at) , fn nvix 4- at) 
— . — + _«„— . — 


. mrx nvai 

« fn sin — cos —y- 


(M) 


If the initial displacement is given by a Fourier series, so that * 


u(x,0) «= f(x) « 2/„ sin Ui(xfl) = 0 


then superposition of the corresponding solutions (6-8) yields 
u{x,t) 


. nirx nrat 
Zfn sm — cos ~~— 


(6-9) 


Similarly, by choosing /o(x) « 0, goix) « gn sin nvx/l in (6-7), the reader can verify 
that the solution satisfying 

tiTrx 

u{xfi) « 0, Ut(x,0) «= g(x) « Zgn sin • 


/ 


/ .X V 

u(x,t) « Z -.sm —“ sm — 

nira I I 


( 6 - 10 ) 


Superposition of (6-9) and (6-10) yields the general P'oorier-sfrm solution of the wave 
equation satisfying (t>-l) and (6-2). The result can be expressed explicitly in terms of 
f(x) and g{x) by means of the Euler-Fourier formulas 

/ 2 2 tlTTX 

fn « Y f{x) sm ~ dx, gn ^ -J g(x) sm — dx. 

Because of convergence questions the Fourier-series solution is somewhat less general 
than (6-7), and it is hopelessly inferior to (6-7) for numerical computation. But Fourier 
series have great usefulness in that they apply to many problems in which the preceding 

CO 

^ Throughout Chap. 6 we use 2) as an abbreviation for 21. A brief review of this 

ii-*! 

notation is given in Chap. 2, Sec. 2, and Fourier series are discussed in Chap. 2, 
Secs. 18 to 25. 
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methods fail. Examples are given for the vibrating string in the next section and for 
other physical systems in the sections to follow. 


PROBLEMS 


1. Show that the expression (6-S) satisfies the appropriate (a) differential equation, 
(b) initial conditions, (r) boundary conditions. 

2. The initial displacement of a freely vibrating string of length I is 

X ^ 1, 

fix) « 26 ~ <x <1, 


and the initial velocity is gix) * 0. (a) Sketch fix) and fo(x). (6) Using (6-7) and your 
sketch, find the di8plac<jment of the mid-point of the string when at ■* 1/4, 

3. (a) Express fix) in Prob. 2 as a Fourier sine senes. (6) By (a) and (6-9) show that 
the displacement of the string in Prob. 2 is 


uixf) 


86/1 irZ irai 1 . 3irX ZtcU 

„ (p y cos - - p «n — .OS — +... 


(c) Obtain an infinite-series representation for the displacement of the mid-point when 
(U • 1/4, 


7. Damped Oscillations. The foregoing discussion was concerned with 
free vibrations, so that F{x,t) == 0 in (2-3). It was indicated that the 
displacement u{x,t) is always periodic in time and hence the amplitude 
remains constant. But, in fact, the oscillations gradually die down when 
a string is vibrating in air, and this behavior is to be analyzed next. 

The reason for the decrease in amplitude is that the air resists the 
motion of an object moving through it. Wlien there is no relative velocity, 
there is no resistance; when there is high velocity, there is high resistance. 
If the resistance is assumed proportional to the velocity, we have 

F{x,t) « —2but{x,t)y 6 > 0, const, (7-1) 

in (2-3). The minus sign is used because the force resists the motion, 
hence is directed opposite to the velocity. Our partial differential equation 
is now 

uu — (i^Uxx = —2but (7-2) 

and the solutions of (7-2) for 6 > 0 represent the damped oscillations of 
the string. As before, one has the initial and boundary conditions 

u(x,0) « f(x)y ut(xfi) « g{x)y (7-3) 

u{0yt) « 0 , u{lyt) « 0 . ( 7 - 4 ) 

Equation (7-2) cannot be solved by the method of the preceding sections 
but can be solved by Fourier series. Thus, since the solution u(x,0 is a 
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twice-differentiable function of for each we may expand u{Xyi) in a 
Fourier sine series 

nirx 

u{Xji) » S6n(0 sin-* 0 < x < 1. (7-5) 

I 


A sine series is chosen rather than a cosine series because such a series 
automatically satisfies the boundary conditions (7-4). To satisfy the 
initial conditions we require 

nvx , nicx 

fix) - S6„(0) sin —. gix) = XbUO) sin —• (7-6) 

These relations show that i)„(0) and 6^(0) must be the Fourier coefficients 
of fix) and gix). That is, if 

2 nrx 2 fi nrx 

fn^-: fi^) sin — dx, 9n ^ I qM sin —- dx (7-7) 

I Jo I I Jo I 

then multiplying (7-6) by sin nirx/l and integrating from 0 to I yield 

bniO) = /n, 6;(0) - gn. (7-8) 


We must still satisfy the differential equation. Upon substituting the 
terms (7-5) into (7-2) we get 

. 2 


nrx ^ /nrY 
S6n sin —^—h a^Ibn yyj 


sin ■ 


nrx 

T 


— 2bSbn sin - 


nrx 

T* 


which gives a set of ordinary differential equations 

,, , / nrd\J 

bn -f 2bbn + ^ 


(7-9) 


when the coefficient of sin nrx/l is equated to zero. 

Equation (7-9) may be solved as in Chap. 1 by assuming that bn == 

It is found that 

bnit) = cos QJnt + siii UjJ, (7-10) 


where 



(7-11) 


The arbitrary constants Co and Ci are determined from (7-8) as 

Co « /n, Cl = iQn + 5/n)a?~\ 

Substituting (7-10) into (7-5) yields the final answer 


[fn 


uix,t) ^ e fn cos 0>nt + (fifn + &/«) 


sin o)nt 


Wn 


nrx 
sin- 

J I 
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Conditions (7-2)**(7-4) are satisfied if the term-by-term differentiation is iegiUmate, 
for instance, if f'\x) and g'*{x) are bounded.^ When «« 0, the solution ag^rees with 
the sum of (6-9) and (6-10), as it should. According to (7-11), the damping reduces the 
frequency of the corresponding terms in the series for undamped vibrations. If 6 < ro/i, 
aU the terms are oscillatory and they have the same damping factor But for larger 
values of b the first few terms may have <*>« pure imaginary. The corresponding trigo¬ 
nometric functions become hyperbolic functions, and the terms in question are not 
oscillatory. If wn « 0, which may happen in this latter case, we replace (sin wnO/wn by 
its limit t [cf. Chap. 1, Sec. 32}. 


PROBLEMS 

1 . A string of length I vibrating in air satisfies the initial conditions tt(x,0) 
» fi sin xx/lf utixfl) « 0. Show that the displacement of the mid-point can be written 
in the form 

cos (w< -f- <^), Af 0 ), (ft const. 

2, Referring to Prob. 1, sketch the curves y « d=Ae~*** and y * w(^Z,0 in a single 
neat diagram. I’hus describe an experimental procedure for determining b. (When 
the oscillations are rapid and h is small, one can speak of the mean amplitude at a given 
time t. If the amplitude is Ao at time to and Ai at time io + r, the reader can verify 
that 



Since Ao and Ai can be found by placing a scale behind the oscillating string, this gives 
a method for comparing the viscosity of gases.) 

8. Forced Oscillations and Resonance. Sometimes the force function 
F(x,0 does not involve the unknown displacement u, as in (7-1), but is 
determined independently. (For example, consider the gravitational force 
on a horizontal vibrating string.) The corresponding mathematical 
problem is 

Utt - a^Uxx = F{r,t), u(x,0) - /(x), u,(x,0) = g(x), 

u(0,t) = 0, u(l,t) = 0. (8-1) 

Associated with this problem are two simpler problems, 

Vtt - a\x == F(x,0, r(a:,0) = 0, = 0, 



vm = 0, 

v{l,t) = 0, 

(8-2) 

and 

Wtt - a^Wxx - 0, 

w{x,0) = /(x), Wtixfi) = g{x), 



3 

II 

o 

w{l,i) — 0. 

(8-3) 


Equation (8-2) describes purely forced vibrations, and (8-3) describes free 
vibrations. Now, if v satisfies (8-2) and w satisfies (8-3), it is easily seen 

' See Chap. 2, Sec. 26, Theorem III. 
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that tt “ p + w satisfies (8-1). Also, uniqueness in the latter problem 
}delds uniqueness in the former. Since (8-3) was solved in Sec. 6, we need 
consider (8-2) only. This system will now be solved formally on the 
assumption that Fix,t) has a Fourier series, 


F(x,/) = SB„(<)8in-—• (8-4) 

t 

The coefficients are given by the Euler-Fourier formulas, 

2 ri rnr^ 

Bn{t) - y I sin -j (8-5) 

Substituting (8-4) and the Fourier series 

mrx 

u(x,t) *= S5n(0 sin “y (8-6) 

into the differential equation (8-1) gives 

„ nvx - nirx mrx 

Zhn sin-h ^o)lbn sin-~ Sj^n sin-» 

I Jr I 

where ojn nica/l [compare (7-11)]. If we equate the coefficients of 
sin nirx/lf we get 

K + Jffin = B„. (8-7) 

These equations are to be solved subject to the initial conditions 

bn{0) - 0, 6;(0) = 0, (8-8) 

which result from the initial conditions in (8-2). By the method of Chap. 
1, Sec. 28 [cf. also Eq. (33-9) in Chap. 1], the solution of (8-7) and (8-8) is 

bnii) “ ^ F„{\j sin W„(< - X) d\. (8-9) 

Determining B„(\) by (8-5), b„(t) by (8-9), and u(x,0 by (8-6) yields an 
explicit formula 

^ 2 mrx ft fi nir^ nira 

u{x,t) =“ S-sin ■— / Bin — sin ~~ (^ - X)F(t,X) d( dX 

mra I Jo Jo I I 

when ojn is replaced by its value mra/L 

If we have both damping and forcing, then (8-7) contains an extra term 2W>n as in 
(7*0. This leads to a different formula (8-9), but in other respects the analysis is un- 
^ changed. Thus, the method of Fourier series enables us to find the damped oscillationB 
of a string with arbitrary initial conditions and force function. 
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If F{Xjt) is periodic in there may be resonance, and this important 
phenomenon will now be discussed for the soecial case (cf. Chap. 1, Sec. 33) 

F(Xit) =s a{x) sin wt + b{x) cos o)L (8-10) 

[In tlie general case F(x,0 is a sum of terms like (8-10), since the assumed 
periodicity enables us to express F(x,t) as a Fourier series in L] With 
F(x,f) as in (8-10) the form of Bn{t) can be determined by inspection of 
(8-5). Substitution into (8-7) then gives an equation of form 

5'^ -f- o)lbn = aeinoyt + 0 cos wt, (8-11) 

where a and ^ are constant. 

If 03^ 9 ^ 0)1, the solutions of (8-11) are all bounded, but if w « a)„, the 
particular integral involves the functions 

t sin o)t, t cos o)t 

which increase indefinitely with t. Hence in that case the term 

nwx 

bn(t) sin —- (8-12) 

in the Fourier series for u(x,t) becomes strongly emphasized as i increases, 
and we say, briefly, that the oscillation (8-12) is resonant. 

A physical explanation is readily given in terms of the results of Sec. 5. 
Thus, the condition w = Wn can be written as 

27r ^ 21 
0 ) na 

This asserts that the period of F(x,t) in (8-10) is equal to the period for 
free oscillations of a string of length I/?:. And l/n is precisely the distance 
between nodes for a vibration of the type (8-12). 

Example: A cord stretched between the fixed points j; »» 0 and x Hs initially sup¬ 
ported so that it forms a horizontal straight line. Discuss the oscillations when the 
support is suddenly removed. 

The force function Fj(x,0 in Sec. 2 is — gp, and hence the partial differential equation 
is 

Utt ** g» (8-13) 

while the boundary and initial conditions are 

w(0,() » w(/,<) *» 0, 

u(x,0) - wi(x,0) * 0, (8-14) 

If wo succeed in finding a particular solution u « v{x) of (8-13) which satisfies the 
boundary conditions 


v(0) « v(l) « 0 , 


(8-15) 
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u(xfi « w(z,t) -f Ka?) ( 8 * 16 ) 

where, a« follows from (8-13) and (8-14), w{Xft) satisfies 

wtt - ahvsx * 0, w(Q,t) » 0, w(lfi •- 0, w(xfi) « -“»(«), 

wtixfi) » 0. (8-17) 

Since the desired particular solution v(x) is to be independent of t, the choice u(x,t) ** v{x) 
in (8-13) yields a*t»" * g, so that 

t,(a:) * - x), (8-18) 


when the integration constants are determined so as to satisfy (8-15). This particular 
solution corresponds to the equilibrium position of the string under gravity. The solu¬ 
tion of the system (8-17) can now be written down with the aid of (6-7) as 

iy{a;,0 « Hfoix - of) -h J4/o(x + of), 
where/e(q:) is odd, has period 21, and is defined for 0 < a: < 1 by 

Mx) -Kic) * ^ ^ 

The required solution is u « v H- ta?. 

By interpreting /o(-r), — at), and fo(x -f at) graphically one finds that uix,t) is 

largest on 0 < a: < i when at ^ 0, 21, 41, and then w*(x,0 » /o(a:). Similarly, w{x,t) 
is least when at I, Zl, 51, .. and then w{x,0 * --/o(x). It follows that the cord oscillates 
between the horizontal position u « 0 and the position u »= 2v(x) in which each point 
is twice as low as the equilibrium position (8-18). The period is 2l/a. 


PROBLEMS 


L A horizontal cable 100 ft long sags 5 ft when at rest under gravity. If the cable 
is disturlxid so that it oscillates without nodes, what is the frequency of the oscillations? 
mnt: See (8-18). 

2 . A string of length / is subjected to a force F(x,i) «» sin U sin vx/l, where u) is 
constant. Find the displacement u(x,t) if the string was initially at rest in the equilib¬ 
rium position. Be sure to distinguish the cases w nira/l and a> «« nira/l. 

3. Show that the equilibrium shape of a string under a force F(x) is described by 

“-siT 

4. Show that the function 


v(x,t) 





ds dr 


satisfies vtt — a\j: ■« F(x,t). Hint: Let x d* cU « r, a: — af » «, v{x,i) « F(r,«). 
AS in Sec, 1, —ia^Vt, » F{x,t). 

t. If v{xfi is the function obtained in Prob. 4, let w(x,t) be determined by 
Wit — 0, w(x,Q) » f(x) — s(®,0), ii;t(x,0) g(x) — Vt{xfi), 


Then, 
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(of. Sec, 3)* Then w -■ w -f satisfies 

Uet - a^Uxx *» u(x,0) » f(x), w<(x,0) » ff(z), 

6 . Solve the Example in the text by means of Fourier series. 


SOLUTION BY SERIES 

9. Heat Flow in One Dimension. The foregoing discussion of the 
vibrating string enabled us to survey the field of partial differential equa¬ 
tions and to illustrate a number of important methods. Prominent among 
these is the method of infinite series, whicli will now be explored more 
fully and used in a variety of applications. We begin with a problem from 
the theory of heat conduction. 

Consider a section cut from an insulated, uniform bar by two parallel 
planes Ax units apart (Fig. 11), and suppose that the temperature of one 


Temperature 


U'i’Au 


x“0 


X x+Ax 


x«l 


Fia. 11 


of the planes is u while tliat of the second plane is u + Aw. It is known 
from experiment that heat flows from the plane at higher temperature io 
that at the lower, the amount of heat flowing per unit area per second 
being approximately 

Aw 

Rate of flow ~ fc- (9-1) 

Ax 

Here A: is a constant called the thermal condivciiinty of the material; its 
dimensions in the cgs syst(im are cal/(cm-sec °C). In the limit as Ax —0, 
Eq. (9-1) can be regarded as an exact equality, so that 

Rate of flow = — ku^. (9-2) 

On tlie other hand, if c is the heat capacity of the medium and p its 
density, the amount of heat in the section from x to x 4- Ax is 

{cpA Ax)w, (9-3) 

where A is the cross-sectional area and vFere w is the mean value of u 
over the interval (x, x 4- Ax). For a time interval (/, t + A^ the increase 
in amount of heat in the section (x, x -f- Ax) can be computed from (9-3) 
and also from (9-2). The computation yields ^ 

* It is supposed that no heat is generated within the material and that fc, p, and c 
are constant over the relevant range of temperatures. If p is measured in grams per 
cubic centimeter, the dimensions of c are cal/(g “C). 
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CfiA Ax il(Xf t + At) cpA Az U{xfy 

IcA At Uxi,x AXf t) JcA At 'iix(.^)t)i 

where w, is the mean value of w* in the time interval (i, t + At), Dividing 
by CpA Ax At we obtain 


U( Xft + AO — ll(Xft) k Ux(x + Ax, 0 — Ux{Xjt) 


At cp 

and letting Ax 0, Ai —» 0 now gives 

Uf sss Qr Uxxt 


Ax 


.2 == 


cp' 


(9-4) 


(9-5) 


if we recall the definition of partial derivative. 


The fact that (9-4) involves mean values causes no trouble when ut and Usx are con¬ 
tinuous; see the discussion of Eq. (7-4), Chap. 5. Thus, (9-5) follows ivtlhoiU apprortma- 
tion from appropriate physical assumptions. This contrasts to the‘wave equation 
uti “ which is only an approximate statement of Newton’s law for the vibrating 

string. 


We shall now solve (9-5) under the assumption that the initial tem¬ 
perature is a prescribed function /(x), 

w(x,0) ~ /(x), 0 < X < i, (9-6) 

which can be represented by a convergent Fourier series. The ends of 
the bar are assumed to have the temperature zero: 

u{0,t) = =0, t>0, (9-7) 

Since u^x must exist if u satisfies (9-5), we know that w(x,0 has a Fourier 
series in x for each fixed f > 0: 

mrx 

u{x,t) == S6n(f) sin- (9-8) 

t 


Here, a sine series is chosen because such a series automatically satisfies 
the requirement (9-7). Proceeding tentatively, assume that (9-8) can be 
differentiated term by term to give 


, nwx ^ / 

261(0 sin — » a^26„(0 (y) 




l ) 


(9-9) 


upon substitution into (9-5). Equation (9-9) is satisfied if the coefficients 
of sin nrxH on each side are equated: 






2 

I 6 „. 
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Upon integration this gives 

Kit) « 

where the c„ are constant, and hence (9-8) becomes 

uix,t) = ' sin (9-10) 

L 

The initial condition (9-6) yields 

^ nvx 

fix) =» uixfi) « Seisin-—- (9-11) 

L 

Since the Fourier series for/(x) converges to fix) by hypothesis, Eq. (9-11) 
is assured if are the Fourier coefficients, 


Cfi 


2 

- / /(T)8in 

/ Jo 


UTX 

- dx. 

I 


The only questionable step in the foregoing discussion was the tenn-by-term dif¬ 
ferentiation, but this step can now be justified. Differentiating (9-10) term by terra 
actually does give 


Uxz 


( nit 

T 


I 




(9-r 


because the series (9-12) are uniformly convergent when t > b > 0. (See Chap. 2, Sec. 7, 
Theorem IV. The uniform convergence follows from the convergence of 

V,j2g -ianrlD^b^ 

since the Fourier coefficients r„ are bounded.) Hf3nce, (9-10) is a solution of the problem. 
We cannot yet say that (9-10) is the solution, becau.se tht‘re might be another solution— 
necessarily different from the one w^e found - for w hich the k^rm-by-berm differentiation 
IS not permissible. A uniqueness theorem is established, howfiver, in Sec. 24. 

Because of the exponential factors the series (9-10) is rapidly con¬ 
vergent and affords a useful means of computing the temperature. By 
contrast, the series obtained in Sec. 0 for solutions of the wave equation 
converges no better than the series for the initial values fix) and gix). 
The physical significance of this difference in the tw^o cases is discussed in 
Sec. 27. 

Example 1. Find the steady-state temperature of a uniform bar. 

It is required that u(x,0 be independent of <, whence by (9-5) 

a^Uxx * u< *» 0. 

Hence, u co -h cix, where co and ci are constant. If the temperatures at the ends 
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are, respectively, uq and wi, we can determine the constants and thus obtain the formula 

w(ar,0 uo -f j (wi — i^o). (9-13) 

The rate of heat flow is given by (9-2) and (9-13) as 


and hence, (9-1) holds without approximation in the steady state. 

Example 2. A rod of length 5 has the end x *= 0 at 0°, the end « 5 at 10°, and the 
initial temp)erature is/(.T). Find the temfK'rature distnl>uti(»n. 

If v(Xjt) is the unknown temperature at point x and time t, we let 

u ^ V — 2x, (9-15) 

where 2x is the steady-state L^mperature determined from (9-13). Then a^Uj^x = 
Ut, u(xfi) » fix) — 2t, w(0,0 u(5,0 — 0. Henee w is given by (9-10), wluTe the CnS 

are the Fourier coefficients of fix) — 2x. When we have found u, Eq. (9-15) gives v 
We have noted that the value 2x introduc<‘d in (9-15) is the 8t(*ady-state teuifierature 
as determined by Example 1 The same method enables us to replace anv constant 
boundary conditions by tlie honu^geneous conditions u(0,/) = w(/,0 * 0 That is, if (he 
unknown temperature v(x,t) satisfies 

r(0,0 « ro, vil,t) «* vi, ro and const, (9-10) 

we doflno u to be the difference between v and the steady-state teinj>erature. 

uix,t) « vix,t) - -h I («i J • 

Then u(0,0 uil,t) ** 0, and henee u can be determined by the method of the text. 
A similar use of the steady-xState solution was made in the Flxample, Sec. 8. 


PROBLEMS 

1. Compute the loss of heat per day per sijuari* meter of a large concrete wall whose 
thickness is 25 cm if one face is kept at 0°(' and the other at 30°(\ Use k = 0.(X)2, 
and axssume steady-state conditions. Hint. The wall can be tliought to be composed 
of bars 25 cm long perpendicular to the wall faces. By symmetry, no heat flows through 
the sides of these bars m the steady state, and hence (9-14) can be applied. 

2. An insulated metal rod 1 m long has its ends kept at 0°C and its initial temiierature 
is 50°C. What is the temperature in the middle of the rod at any subsequent time? 
Use k » 1.02, c « 0.06, and p « 9.6, 

3. I..et the rod of Prob. 2 have one of its ends kept at 0°C and the other at 10°C. 
If the initial temperature of the rod is 50°C, find the temperature of the rod at any 
later time. Hint: See Example 2. 

4. An insulated bar with unit cross-sectional area has its ends kept at temperature 

0, and the initial temperature is fix) « Cn sm nrx/l, where Cn is constant and n is an 
integer, (a) Show that the amount of heat present in the bar initially is 2lcpcjnv if 
n 28 odd and 0 if n is even. (5) Show that the net rate of flow out of the bar across the 
ends is when n is odd and 0 when n is even. Htrd: The rate of 
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flow out of the bar at the end a; « 0 is -f few*, not --kux. (c) How much heat flows out 
of the bar in the time from < *» 0 to f? Evaluate as ^ compare (a), and explain. 

6. By addition of the results in Prob. 4 obtain similar results for the bar with arbitrary 
initial temperature/(i). 

10. Other Boundary Conditions. Separation of Variables. In the fore¬ 
going section the differential equation 

Ut ^ Oi Uxx ( 10 - 1 ) 

was obtained for the temperature u{x^t) of an insulated bar at point x 
and time t. The initial condition was 

u{xfi) « fix), 0 <x <1, (10-2) 

and the ends were held at constant temperature. 

If, instead, the ends are insvtated, the boundary conditions are 

vAOf) == 0, uAlf) = 0. (10-3) 

Equations (10-3) are appropriate because by (9«2) they state that the 
rate of flow across the ends is zero. We shall now consider the problem 
posed by (10-1)-(10-3). 

The boundary conditions (10-3) are satisfied automatically if we express 
uixjt) as a cosine series: 

] mrx 

uixyt) = -aoit) + Sa„(0 cos—* (10-4) 

ji I 

Thus, Ux in (10-4) is a sine series (assuming that one can differentiate 
term by term), and we have already noted that the sine series vanishes 
at a:- == 0 and 1. 

Substituting (10-4) into the differential equation (10-1) gives 

) fln, (10-5) 

just as in the derivation of (9-9). Solving (10-5) and substituting into 

(10-4), we find 

1 , mrx 

w(x,f) = - Co + ^ (10-6) 

2 / 

where the CnS are constant. The initial condition (10-2) shows that the CnH 
are the Fourier cosine coefficients, 

2 fi mrx 

c„ = j jj, /(^) cos — dx, (10-7) 

and the problem is solved.* 

‘ The solution can be verified, if desired, as m the previous section. 


^ ' n ' 2 /”’' 

2 ^^0 ^ \ / 
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We shall now solve this same problem by an important method known 
as aeparution of variahleB. It will prove interesting to compare the various 
stages of the solution with the answer, (10-6). 

The desired solution (10-6) is a sum of terms each of which has the form 

X{x)Tit), (10-8) 

In the method of separating variables the idea is to construct functions 
of the form (10-8) which satisfy the differential equation and the boundary 
conditions. By superposition of these functions (10-8), one then satisfies 
the initial conditions. The fact that there is a solution of the type (10-6) 
gives good reason for expecting the method to succeed. 

Substituting (10-8) into (10-1) yields 

xr = a^r'T, 

where the prime denotes differentiation with respect to the appropriate 
variable. Dividing by XT we get 


r 


T 



(10-9) 


The variables x and t in (10-9) are separated, in that the left side is a function 
of t alone and the right side is a function of x alone. It follows that each 
side must be constant, independent of both x and t. A brief investigation 
of the effect of changing sign in (10-10) shows that XT can satisfy (10-3) 
only if the constant is zero or a negative number—Thus, 

T^ X'^ 

— « = -p2. (10-10) 

Independent solutions of (10-10) are ^ 

T =s X = cos-x, X == sin-x. (10-11) 

a a 


The boundary condition Wa:(0,0 = Ofor?^ = requires that X'(0) = 0, 
and hence the appropriate choice of X in (10-11) is 


A - cos-x. 


( 10 - 12 ) 


Similarly, the condition Ux{l,t) « 0 gives X'(^) = 0, so that 

nira 


P 


I 


(10-13) 


^It ifl suggested that the reader compare XT at this and subsequent stages with the 
general term of (10-6). 
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where n is sb integer. By (10-11)'-(10«13) we see that the function 

T{t)X{x) « cos ^ (10-14) 

satisfies the differential equation and the boundary conditions. To satisfy 
the initial conditions we form a superposition of terms (10-14). The 
resulting series is precisely the series (10-6), and the solution is completed 
as before. 

The merit of the separation method is that it produced the functions 
cos {nrx/l) by direct consideration of the differential equation. If some 
other functions had been more appropriate, the method would have pro¬ 
duced those other functions instead. This fact will now be illustrated by 
an example. 

According to Newton’s law of cooling, a body radiates heat at a rate 
proportional to the difference between the temperature n of the radiating 
body and the temperature ?^o of the surrounding medium. Thus, if our 
insulated rod of length 1 has the end j = 0 maintained at temperature 0 
whiUi the other ('iid radiates into a medium of temperature uq — 0, the 
corresponding boundary conditions are 

a(0,0 = 0, Wx(/,0 ~ —hu{l,i), (10-15) 

where h is constant. [The second condition (10-15) states that the rate 
of flow --his is proportional to u{l,() — 0, and this agrees with Newton’s 
law.] If h — 0, there is no radiation and we have the condition for an 
insulated end as discussed prc\iou.sly. But if h > 0, which we now assume, 
the problem is essentially different from those' considered hitherto. The 
difference results from the fact that (10-15) cannot be satisfied in any 
simple way by an ordinary Fourier scries. 

Actually, as we show next, the appropriate functions for the problem 
(10-1), (10-2), and (10-15) are not sin (mrx/J) or cos {nirx/I) hut arc sin 
where the i3„s are the positive roots of the transcendental etjuation ' 

cos = —h sin 01. (10-16) 

Although one could hardly expect to discover the sequence sin 0n^ by 
a priori considerations, it is produced automatically by the method of 
separating variables. The solution to the problem is found to be 

u{x,t) sin(10-17) 

' Since the equation is equivalent to tan/3i « -0/h when h 5 ^ 0 , its roots can be 
obtained grapliically by considering the intersection of the curves j/ *« tan and y » 
-0/h. Cf. Example 2, Sec. 2 , Chap 9. 
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where Cn is given in terms of the initial values by 

f f{z) sin dx 

Cn * - (10-18) 

Jo 

To obtain this solution by separating variables, observe that the substitution 
u ** X(x)7\t) leads to functions of the type (10-11), exactly as in the former case. Here, 
however, the condition w(0,0 «* 0 gives X(0) « 0, so that we require the sine rather 
than the cosine. The resulting expression 

T(l)X(x) - sin - x 

a 

becomes (10-19) 

if we set p « a/9, and this form will be more convenient for our purposes. The function 
(10-19) satisfies (10-1) and the first boundary condition (10-15) for all values of the 
constant /8. To satisfy the second condition (10-15) we must choose j9 so that 

cos sin 01, (10-20) 

and tliis leads to (10-16). The resulting functions 

sin 0nX 

satisfy both boundary conditions (10-15) and also satisfy the differential equation 
(10-1). If a suitable superposition (10-17) is found to satisfy the initial condition, our 
problem will be solved 
Setting f * 0 in (10-17) gives 

f(x) « Xcn sin 0nX. (10-21) 

As in Chap, 2, Sec. 22, Example 1, we can show that the functions sin 0n^ are orthogonal 
on (0,0, and hence the CnS are given by (10-18). llie solution can be venfied by the 
method of Sec. 9 if f(x) admits an expansion (10-21). Since an analogue of Dirichlet’s 
theorem holds for the sequence am 0nX, Eq. (10-21) is not a serious restriction on f(x). 


PROBLEMS 

1. If /(x) « g{t), where x and t are independent variables, show that f{x) and git) 
are constant. Hint' Let < ™ to, a fixed value. 

2. Attempt to satisfy the conditions (10-3) by choosing a positive constant -f-p^ 
instead of — p^ in (10-10). 

8. By using the functions (10-11) solve 

ut « a^Uxx, w(0,0 »= uil,t) as 0, 'a(x,0) « fix). 

4. (a) Describe a physical situation which would lead to 

Ut *» a^Uxxf ^^(0,0 Uxilfi) “ 0, m(x,0) «« fix). 

(h) Solve by separating variables [cf. (10-11)]. (c) Verify that your result agrees with 
(10-16)-(10-18) for A « 0. 

6. Solve Prob. 4 by the method of images. 
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Outline of the Solution. Consider a rod of length Zl with ends at temperature 0. Let 
the initial temperature /o(x) agree with f{x) on (0,f), and let /o(x) be symmetric about 
X « I (Fig. 12). By symmetry, no heat flows across the center, and hence the left 




half of the long rod behaves like the rod of Prob. 4, The temperature for the long 
rod can be found from (9-10), 

6. The vertical displacomenl w(x,0 of a vibrating string with fixed end points satisfies 

Utt « w(0,0 a(/,f) * Ov 

By s()tting u{x,i) « X(x)T(t) and separating variables, obtain solutions of the form 

mrat nirx nvat mrx 

sin —sin -y- and cos — j— sin —y- • 

7. In Prob. 6, express u{x,t) as an infinite series if 

u(xfl) «= fix), ntixfi) » 0. 

8 . In Prob. 6, express u(x,l) as an infinite series if 

ti(j,0) « 0, =» g{x). 

11. Heat Flow in a Solid. By a procedure similar to that of Sec. 9 one 
can establish the equation 

k 

It, = -f Uyu + M/j), ^ —> (11-1) 

cp 

for the temperature ^ // = u{x,y,Zyi) in a uniform solid at time t. This is 
the three-dimensional form of the eiiuation 

Ut = a^u^rx ( 11 - 2 ) 

obtained previously for heat conduction in a rod. The state of the solid 
at time t = 0 gives the initial condition; the state of the surface for / > 0 
gives the boundary condition. P'or instance, if the surface radiates accord¬ 
ing to Newton’s law, the boundary condition is 

du 

-Jc — = eiu - Mo), (11-3) 

dn 

where mq is the temperature of the surrounding medium, e the emismity, 

' See the derivation in Chap. 5, Sec. 16. A similar equation governs diffusion and the 
drying of porous solids, with u equal to tlie concentration of the diffusing substance. 
Because of this analogy many problems on diffusion and heat conduction are mathe¬ 
matically indistinguishable. The constant a® in (13-1) is often called the diffustvity. 
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aad du/dn the derivative in the direction of the outward normal. When 
c ate 0, E<}. (11-3) means that the body is insulated. 

Sometimes there is so much symmetry that 
u in (11-1) does not depend on y or z. In this 
case (11-1) is the same as (11-2), since the 
terms and in (11-1) are zero, and the 
analysis of Secs. 9-10 can be applied without 
change. 

As a specific illustration consider a uniform plate extend¬ 
ing from the plane x « 0 to the plane x «» d (Hg. 13). 
Let u « uo on the surface x 0 and w *» ui on the surface 
X ** d, where uo and uj arc constant. If the plate is infinite, 
or if the edges are fai away from the points being con¬ 
sidered, the symmetry suggests that u depends on x only 
and, hence, that (11-2) holds. The steady-state t43mi.)er- 
ature is then given by Example 1, Sec. 9, as 

X 

w « ito + j (ui - no). 
a 

Since the rate of flow is —kuz, the amount of heat Q 
flowing across the area A in t sec is 



htA 


Uo — Ui 


If the flow of heat is steady, so that w is independent of time, then ut 
and (11-1) reduces to 


Uxx + Uyy + Uzz = 0. (ll“4) 

This is known as Laplace's equation; it occurs in a 
variety of physical problems. The corresponding 
two-dimensional form is 

Uxx + Uyy — Of U = u(Xfy), (11-5) 

To illustrate the use of (11-5) we shall discuss 
the steady-state temperature in an infinitely long 
metal strip of width d (see Fig, 14). If the sides 
of the strip have the temperature ziero and the 
bottom edge has the temperature/(a;), the boundary 
conditions are 



Fia. 14 


uiO^y) * 0, u{dfy) «0, w(x,0) «/(x). (11-6) 

We assume besides that (11-5) holds for 0 < x < d, |/ > 0. 

It is a surprising, fact that these conditions do not suffice to determine 
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the temperature.^ However, one expects the temperature to approach zero 
as one moves away from the bottom edge, so that 

lim u(x,y) =* 0 uniformly in x, (11-7) 

•Hl> «> 

If this condition is explicitly required, the solution can be shown to be 
unique (see Sec. 24). 

Although the problem can be solved very simply by Fourier series, we 
prefer to show how the desired functions are generated by the method of 
separating variables. The choice u = X(x)Y{y) in (11-5) gives 





( 11 - 8 ) 


after dividing by XF. Since the variables in (11-8) are separated, each 
side is a constant. The boundary conditions applied to XF show (after 
some calculation) that the constant must be a negative number —and 
hence (11-8) gives 



Since (—p)^ = p^, we can assume that p > 0 with no loss of generality. 
Linearly independent solutions of these equations are, respectively, 

cos px, sin px and 

Since u{0,y) = 0 requires that X(0) = 0, we reject the cosine, and in 
view of (11-7) we reject the solution Hence the function XF takes the 
form 

XF = c"“^^&inpx. (11-9) 

The boundary condition u{d,y) — 0 gives p = riw/d, where n is an 
integer. Forming a linear combination of the resulting solutions (11-9) 
we get 

u{x,y) = sin- (11-10) 

d 

and the condition u{xfl) == fix) now shows that the c,i8 are the Fourier 
coefficients 

2 fd mrx 

Cn = - / /(x) sin —— dx. 
dJo d 

The solution can be verified, if desired, as in Sec. 9. 

‘ The trouble is that the other end of the strip must be taken into account even though 
it is infinitely far away. This purpose is served by (U-7). 
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The foregoing derivation obscures an important point which will now be discussed 
more fully. Although the solutions 

and 

can be chosen for the equation Y" = p^F, these are not the only possibilities. Another 
pair of independent solutions, for example, is 

cosh py and sinh py. 

If, now, we try to decide which of these functions satisfies (11-7) it will be found that 
neither one does. 

What is really involved is the following: The general solution of Y" « p^F is 

F « aePv 4- 

where a and 6 are constant. By (11-7) we get a = 0, and hence F »» 6cThe reader 
can verify that if 

F » oo cosh py -f 6o sinh pt/, 

the condition (U-7) ^dll give ao -f 6o « 0, and again F is a multiple of Similar 

remarks apply to the construction of A'(x) and to the derivation of (10-14). 

Just as in the case of the rod, this problem involving a strip can be given 
a three-dimensional interpretation. That is, the strip netul not \ye thin 
provided there is no variation of temperature atToss its thickness. By 
letting the thickness approach infinity, we get a semi-infinite plate. (Jn 
Pig. 14 the plate extends infinitely far toward and away from the reader; 
the area outlined in the figure is the cross section of the plate, not a frontal 
view\) The boundary-value problem for the plate is 

Uxx + ^vv + = 0, 0 < X < d,XJ > 0, —00 < 2 < 00, (ll-H) 

u{Q,y,z) = 0, u{d,y,z) = 0, u{x,0,z) =/(x), (11-12) 

lim u{Xyy,z) = 0 uniformly in x and z. (11-13) 

y —> Bp 

If we assume u independent of z, the resulting problem is the same as that 
formerly considered, hence has the uni(pie solution (11-10). 

The fact that u(x^y,z) is independent of z does not follow from the physical symmetry 
but requires the condition (11-13). Indeed tlie function 

. 'irx , rry ^ 
u Sin Sin -~e ^ 
a d 

satisfies (11-11) and (11-12) with /(x) *= 0 and yet depends on z. Reduction of the 
dimension by omitting a variable is really an application of uniqueness. If we verify 
that (11-10) satisfies the problem (11-11) to (11-13), and that the problem has no other 
tolutiouj then it is true that u must be independent of z. 


PROBLEMS 

X* A refrigerator door is 10 cm thick and has the outside dimensions 60 by 100 cm. 
If the temperature iaside the refrigerator is — 10®C and outside is 20°C, and if A 0.0002, 
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find the gain of heat per day across the door by assuming the flow of heat to be of the 
same nature as that across an infinite plate. 

2. If f(x) *» 1 and d «* tt, show tlmt (11-10) gives 

u«~fe~*' 8 inj-|-i sin 3 x *f 7 e~^ sin fir -i-V 

IT \ 3 0 / 


6. A semi-infinite plate 10 cm in thickness has 
its faces kept at 0®C and its base kept at lOO'^C'. 

What is ilie steady-state temperature at any point 
of the plate? 

6. I'he faces of an infinite slab 10 cm thick are 
kept at temperature 0®C. If the initial tempera¬ 
ture of the slab is 100®C, wliat is the state of the 
t<jmperature at any subsequent time? 

7. A large rectangular iron plate (Fig. 15) is 

heated throughout to 100®C and is placed in con¬ 
tact with and between two like plates each at 0®C. Fig. 15 

The outer faf*es of these outside plates are main¬ 
tained at 0®C. Find the temperature of the inner faces of the two plates and the 
temperature at tlie mid-point of the inner plate 10 sec after the plates have been put 
together. Given; a =< 0.2 cgs unit. Hint The boundary and initial conditions are 

u(0,<) « 0, a(3,0 - 0, w(u,0) - /(j), 

where/(j) * 0 for 0 < x < 1 and 2 < x < 3 but /(x) « 100 for I < x <2. 

12. The Dirichlet Problem. The Ijaplacc equation 

"4" ttzz ~ 0 (12-1) 

was obtained in Sec. 11 for steady-state heat flow. We shall show how the 
same equation arises in electrostatics and gravitation.^ 

It is a consequence of Coulomb's law that the potential due to a point 
charge q at (:r 1 , 2 / 1 , 21 ) is 

q 

u - - taking u = 0 at r = oo, (12-2) 

r 

where r is the distance from the charge to the point (t, 2/,2) at which u is 
computed. Thus, 

r2 = (x - + {y- yx? + (z - r > 0. (12^) 

* A more complete discussion is given in Chap. 5, Sec. 14. The relation of Laplace^s 
equation and fluid flow is developed in Chap. 6, ^cs. 15 and 17, and in Chap. 7, Sec. 19, 
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The potential due to a distribution of n point charges g,- is given by addition, 

^ ^ (12-4) 

n 

and the potential due to a distribution of continuous charge of density p 
in a body r can l>e obtained from an expression like (12-4) by passing to 
the limit. 

It is easily shown that 1/r satisfies Laplacc^s equation (12-1), and hence 
the same is true of u in (12-4) provided no r, is zero. This latter condition 
means that there is no charge at the point of observation. One would 
expect, therefore, that the potential due to a continuous charge distribu¬ 
tion will also satisfy (12-1) if there is no charge at the point of observation. 
Tills is actually the case, and that is the reason why Laplace^s equation 
plays such a prominent role in electrostatics. Although a more sophisti¬ 
cated treatment may be given, it all comes down to tlie same thing; namely, 
1/r satisfies (12-1), and the potential is given by some sort of superposition 
process applied to 1/r. 

Since the gravitational potential satisfies (12-2) (where q is the mass of 
the attracting mass point), the study of gravitation also leads to LaI)lace^s 
equation. In view of its many applications, the Laplace equation (12-1) 
is profitably regarded as a field of study in its owm right. Such a study 
leads the W'ay to a branch of analysis known as poterUial theory. 

An important problem in potential theory is the DirichM problem^ 
which can be stated as follows: Suppose given a body r in {x^y^z) space, 
together with assigned values/(x, 2 /,z) on the surface of r. Find a function 
u which satisfies Laplace^s equation in r and is equal to f(x^y,z) on the 
surface. The foregoing discussion gives a number of physical interpreta¬ 
tions. For instance, if u is temperature, the Dirichlet problem is to find 
the steady-state temperature in a uniform solid when the temperature 
on the surface is given. But if v is the electrostatic potential, the problem 
is to find the potential inside a closed surface when the potential on the 
surface is known. Interpretations in terms of diffusion, fluid flow, and 
gravitation can also be given. 

Since solutions of Laplace's equation are often called harmonic functions^ 
Dirichlet’s problem can be stated as follows: Find a function which is 
harmonic in a given region and assumes preassigned values on the boundary. 
In two dimensions a harmonic function u{x^y) satisfies 

Uxx '^vy ** ( 1 - 2 - 6 ) 

The region in Dirichlet's problem is now a plane region, and its lioundary 
in a cuiwe. The physical interpretation refers to phenomena in a thin 
plane sheet, or it refers to three-dimensional phenomena which show no 
dependence on z. The latter condition is to be expected when there is 
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cylindrical symmetry, that is, when all planes z « const exliibit the same 
geometry and boundary conditions. 

We shall now solve the Dirichlet problem for a circle. It turns out that 
the problem is greatly simplified by use of polar coordinates appropriate 
to the circular symmetry. With 

X ^ r cos Of y ^ r sin 0, uix^y) js U{rfi)f 

an elementary calculation shows that (12-5) becomes 

{rUr)r + ~Ue8^0 (12«6) 

r 

(see Prob. 2). The boundary condition can be expressed as 

UiRfO) -/(^), (12-7) 

where/(i9) i.s a known function of B and R is the radius of the circle. 

For each value of r it is clear that U has period 27r in 6, since u is single¬ 
valued, and therefore U has a Fourier series 

UiTfO) --h S[an(r) cos + 6n(r) sin 7ud]. (12-8) 

2 

Proceeding tentatively, we substitute (12-8) into (12-6) to obtain 

^ + S[(ran)' cos 7vd + (rh^Y sin iiB] 

- X(ann^ cos fiB -f- sin nB) ~ 0. 

r 

Since the coefficients of cos nd and of sin nB must vanish, 

(ralfY = ~ n^anf n = 0, 1, 2, .. 
r 

(rby = - n%nf n = 1, 2, 3, .... 
r 

These equations are both of form 

riry'Y = n^y 

which is readily solved by the method of Chap. 1, Sec. 30. Specifically, 
the substitution y ^ gives 

r{ar^Y 

whence a « dbn. Since a«(r) and hn{r) must be finite at r « 0, the minus 
sign is excluded, and 


(? 


a,(r) - OnT', 


bnir) - 6„r» 
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where a* and 6* are constant. Hence by (12-8) 

O/f) 

XJ{rfi) «* -—h ^{anT^ cos vB + sin n^)* (12-9) 

2 

Putting r « iJ and using the boundary condition (12-7) give 

Un 

/(0) =-[- 2(ani?“ COS ?i$ + 6n/2” sin n3), (12-10) 

2 


If/(^) has a convergent Fourier series, the validity of (12-10) is ensured by 
choosing UnR^ and bnR"^ to be the Fourier coefficients of /: 

1 rff 

an/2” = - / /(0) cos mf) d<l>, 

( 12 - 11 ) 

1 rr 

6„/2” = - / /(</>) sin n<t> dip. 

TT •'—IT 


The problem is now solved, but a simpler form can be found as follows: 
Substituting (12-11) into (12-9) gives 


1 r /^V 

+ 2 y oosn(^-.^) 


/( 4 ) d<t>, ( 12 - 12 ) 


when we note that 


cos nd cos n<p sin nO sin mp = cos {nO — n<p) 


and interchange the order of summation and integration. The series in 
brackets in (12-12) can be summed as in Chap. 2, Sec. 17, Prob. 0. Tlie 
result is the Poisson formula for a circle ^ 




R^-r^ 


27r •'-» — 2rR cos {6 — <l>) + r 


;m d4. (12-13) 


If f(4>) is piecewise continuous and bounded, one can differentiate under the integral 
sign for f < 12 to find that (12-6) holds. Also, it can be shown that (12-13) gives 

lira U{r,e) '»f{e) (12-14) 

f —♦ jRi — 


provided / is continuous at B. Hence (12-13) is a solution. In view of the derivation, 
it is remarkable that (12-14) holds even when the Fourier series for/ does not converge 
to/. 

The expression (12-13) gives the steady-state temperature of a thin 
uniform insulated disk in terms of the temperature at the boundary. Or 

* Another derivation is given in Chap. 7, Sec, 21, 
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(12-13) can be interpreted as giving the temperature in a circular cylinder 
when the temperature of the surface is/(^) independent of z. On the other 
hand the formula also gives the electrostatic potential in terms of its 
values on the boundary, and so on. 

Example: I^et uix^y) be harmonic in a plane region, and let C be a circle contained 
entirely in the region. Show that the value of u at the center of Cis the average of the 
values on the circumference. 

Without loss of generality we can take the center to be at the origin. Equation (12-13) 
then gives, with r « 0, ^ 

tt(0,0) - 1/(0,9) m <4. (12-15) 

Since /(<^) stands for the values of u on the boimdary, this is the required result. 

PROBLEMS 

1. (a) Verify that 1/r in (12-2) satisfies the Laplace equation (12-1). Hint: rr, «• 

X — xi. Using this, find (l/r)**. (6) Verify that logr satisfies the two-dimensional 
I^aplace equation (12-5). 

2. If u(x,y) — U(r,B) with x » r cos y « r sin show that 

+ Mw “ r'^\rUr)r + 

Hint: Ur ^ Ux cosB -^Uy sin Ui ** Wx(—r sin B) -f Wy(r cos $), Similarly, compute 
(rUr)r and (t/a)#. 

8. Derive (12-10) by considering U « R(r)B($) and separating variables. 

4. Give two physical interpretations of the following Dirichlet problem for a semi¬ 
circle, where w(x,y) ■» U(r,0) as in (12-6): 

^ 0 , X* -f y® < 1, y > 0, 

11(1,B) « g(B), 0<e <ir, 

U(r,0) « U(r,ir) « 0, 0 < r < 1. 

5 . Solve Prob. 4 by the method of images. Hint: For 0 < B < %, define f(B) «• g{0), 
f(--B) — —giB) and use (12-13). 

6. Obtain a formula analogous to (12-13) for the region r > R. (Assume that | U(rfi) j 
is bounded as r —► and, hence, that positive values of n in the discussion of the 
text may be rejected.) 

7. Interpret the result of Prob. 6 physically in terms of an infinite metal plate with a 
hole whose edges have a prescribed temperature. 

13. Spherical Symmetiy. Legendre Fimctions. Let it be required to 
determine the steady-state temperature in a uniform solid sphere of radius 
unity when one half of the surface is kept at the constant temperature 0*^C 
and the other half at the constant temperature 1®C. By the discussion 
of Sec. 11, the temperature u within the sphere satisfies Laplace's equation 

+ Uyy + Uag 0. il3-f 

1 
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Symiaetiy suggests the use of spherical coordinates with origin 
at the center of the given unit sphere {Fig. 16). Since 

X » r sin 0 COB 0, 

y «= r sin ^ sin 

2 « r cos 6, 



Laplace’s equation can be shown to be' 

r(rJ7)rr + (Ue sin 0)$ esc 0 + csc^ ^ « 0, (13-2) 

where u{x,y^z) « U{rfi^4>), If the plane separating the unequally heated 
hemispheres is the xy plane, the symmetry suggests that U will be inde¬ 
pendent of so that (13-2) becomes 

T(rU)rr + (Ussm 0)$ CSC 0 = 0. (13-3) 

The boundary conditions are 

u « 1 for 0 < 0 < }/iv, when r = 1, 

(13-4) 

u = 0 for < 0 < IT, when r * 1. 

We shall use the method of separating variables. Substituting the form 

U « iB(r)e(0) 

into (13-3) gives two ordinary differential equations, 


r(rB)" - «« « 0, 
(0' sin 0)' CSC 0 + 00 » 0, 



(13-5) 


* See Chap. 5, Sec. IS, or proceed as in Frob. 2 of the preceding section. 
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vbsre a is an arbitrary conataat. The first of these equaticms can be Bolved 
by assuming that JR as in Chap. 1, Sec. 30. One obtains the linearly 
independent solutions 

where m satisfies the quadratic equation 

m(m + 1) “ a. (13-6) 

Changing the independent variable in (13-5) from 4 to a: by meaTia of 
X — cos d, 6{e) = P(x), 

and replacing a by the expression (13-6), we get Legendre’s equation 

(1 - x®)P" - 2xP' -1- m(m + 1)P = 0, ' = — • (13-7) 

dx 

When m is a nonnegative integer, a solution of (13-7) is the Legendre 
polynomial Pm{x) = Pm(eo8d). Thus, one is led to consider solutions 
of (13-3) which have the form 

r^Pm{<^osO) or r"“^”*'^^^P„(cos0). 

The second of these expressions is rejected because it becomes infinite 
as r —> 0, and we attempt to build up the desired solution u by forming a 
series 

00 

w = Z) (cos e). (13-8) 

tnaaO 

Each term of this series satisfies (13-3). 

When r « 1, Eq. (13-8) becomes 
00 

« = 53 AmPm (cosff), r » 1, (13-9) 

and if it is possible to choose the constants in such a way that (13-9) 
satisfies the boundary condition (13-4), then (13-8) will be a solution of 
the problem. Since x = cos the boundary condition requires 

00 

F{x) = 23 d„P„(x), (13-10) 

where F{x) « 0 for —1 < x < 0, and F{x) = 1 for 0 < x < 1. Now, it 
was stated in Chap. 2, Sec. 22, that tlie expansion (13-10) is possible for 
suitably restricted functions F{x) and that the coefficients are gi 

d* »= -H P(x)P«(x) dx. 
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By means ol this formula, the solution is found to be 

« - M + HrPiicos 0 ) - Hif^Psicos 0 ) + iH2r®P8(cos «)-•••. 

It is possible to establish that (13-S) is actually a solution, though the demonstration 
requires a detailed knowledge of Legendre functions.^ The uniqueness theorem es¬ 
tablished in Sec. 24 shows that there is no other solution, and hence the foregoing 
procedure can be justihed. In particular it was permissible to take m as a nonnegative 
integer and to use the polynomial solution of (13-7) rather than one of the infinite-series 
solutions. 


PROBLEMS FOR REVIEW 

1. As an infinite series, express the steady-state temperature in a circular plate of 
radius a which has one half of its circumference at 0*^0 and the other half at 100*^0. 

8* By (12-13), find the temperature of the plate considered in Prob. 1. 

8. By separating variables in polar coordinates find the steadynstate temperature in 
a semicircular plate of radius a if the bounding diameter is kept at the temperature 
0*^0 and the circumference is kept at the temperature 100'’C. 

4 . Interpret the following Dirichlet problem physically, and solve; 

u** -huyy » 0, 0<x<l, 0<y<l, 

u(0,v) •» - u(a:,0) - 0, - /(x). 

8. Derive (13^) from (13^). 

14. The RectanguiRT Membrane. Double Fotirier Series. Let a uni¬ 
form elastic membrane be stretched 
over a fixed, plane, bounding curve 
(Fig. 17). To explain what is meant 
by the tension ^ we consider the force 
AF exerted by the membrane on one 
side of a small straight slit of length 
As. The membrane is said to be 
under uniform tension T if this force 
is directed perpendicular to the slit in 
the plane of the membrane and has 
magnitude T As independent of the 
location and orientation of the slit. 
A similar definition applies when 
the membrane does not lie in a plane except that we must let As 0: 



A# '“t 0 As 


The role taken by the plane of the membrane in the first case is now taken 
by the tangent plane at the point in question. 

^ Due must show tl^at the series obtained by differentiating (13-8) are uniformly con¬ 
vergent for r < 1 — S And that the boundaty condition is verified as r 1. 
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Let the coordinate system be so chosen that the bounding curve of the 
membrane lies in the xy plane. The vertical displacement of any point in 
the membrane at time t is denoted by u » To obtaun a dif¬ 

ferential equation for the motion, we consider a small, nearly square portion 
of the membrane bounded by vertical planes through the points 


(a:,y,0), {x + Ax, y, 0), (x, y + Ay, 0), (x + Ax, y + Ay, 0) 

(see Fig. 18). Applying Newton's law to the small portion gives tihe ap¬ 
proximate equation 

T 

Utt » y^(Uxx + Uyy), ^2 ^ (14-1) 

P 


where p is the surface density. This equation describes smaU oscillations 
of the freely vibrating membrane. Its derivation is similar to the cor¬ 
responding derivation for a vibrating string (Sec. 2). 



The problem of the vibrating membrane is solved when we have found 
the solution of (14-1) which satisfies appropriate initial and boundary 
conditions. We shall now consider the case of a clamped rectangular 
membrane witli sides of lengths a and h (Fig, 19). The boundary conditions 

u » 0 for X « 0 and for x == a, 0 < y <h, 

(14-2) 

u ~ 0 for y «= 0 and for y « 6, 0 < x < o. 



To determine the solution uniquely we also specify the initial displacement 
and initial velocity; 

«(x,y,0) - fix,y), utix,y,0) •= gix,y). (14-3) 
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The assumption that 
in (14-1) yields 


« X{x)Y(y)T{t) 



(14-4) 


upon division by XYT. Since the variables are separated, the terms in 
(14-4) are constant. It can be shown that these constants are negative, 
SO that we may write 

X" 2 F' , r" , 

X Y ^ » jr » 

with Y®(p® -f ^) « by (14-4). 

Since X" -f- p^X « 0, the function X(x) is a linear combination of 
sin px and cos px. The cosine is rejected because the condition u ~ 0 at 
a? » 0 gives X(0) = 0, and we must have p = wir/o, where ?n is an integer, 
because the condition u *= 0 at x « a gives X{a) == 0. In just the same 
way it is found that 

Y ~ sin gy, 

^ere q » tnr/b for an integer n. Thus, the desired oscillation has the form 


mrx nwy 

sin-sin-(4 cos + B sin a)mnO» 

a b 

inime A and B are constant and where Wmn = « is given by 



(14-5) 


The functions (14-5) satisfy the differential equation and the boundary 
condition. To satisfy the initial conditions (14-3) we try a superposition, 
using different constants A and B for each choice of m and n: 


^ mTX mry 

* 2^ (Amn cos 0)^nt + B^n Sm (Ornnt) sin-siu “* (14-6) 

ni.n>«l ^ b 


Since the initial displacement isfix.y), we must determine so that 


„ rmrx nrry 

2^ jl„„8in-sin-—- 

mifiMi o b 


Multiplying this double Fourier series by sin (wirx/a) sin {uTry/b) and inte- 
Ifratmg over the rectangle give the formula 


4 


■* 


mirx 
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just as in the carrespcsidizig discussicm for single Fourier series (Chap. 2, 
Sec. 18). Similarly, differentiating (-14-6) with respect to t and setting 

t vm 0 give 

fimn « ~r- / / - mn — dxdy 

dbu>mn a b 


when we use the second initial condition (14-3). 

The general term of the series (14-6) is a periodic function of time with 
period 2ir/‘*>»nn. The corresponding frequencies 


Wfnn 



cps 


(14-7) 


are called characteristic frequencies, and the associated oscillations (14-5) 
are called modes. The fundamental mode is the mode of lowest frequency, 
obtained by setting m = n = 1. 

Similar terminology applies to the vibrating string (Secs. 2-^). If the 
length of the string is a and the equation of motion is 

Uu « y^u^xt 


the characteristic frequencies may be written in the form 



(14-8) 


analogous to (14-7). The modes are described by 


mrz 

It «= sin-(-4 cos o)mt + B sin Q)mt) 

a 


and the fundamental is the mode obtained for m = 1. The three-di¬ 
mensional analogue of (14-7) and (14-8) is discussed in Prob. 2. 

In Sec. 8 it was shown for the vibrating string that the characteristic fre¬ 
quencies agree with the resonant frequencies, and a similar behavior is 
found for vibration phenomena in general. It is also true in general that 
the vibration can be expressed as a superposition of individual modes. 
This fact is illustrated by (14-6) and by the Fourier-series solution for the 
vibrating string. 

The behavior of the vibrating membrane differs from that of the string in one respect. 
For each characteristic frequency of vibration of the string the corresponding mode is 
such that the string is divided into equal parts by the nodes whose positions are fixed. 
Whan a membrane oscillates with a given characteristic frequency, there are also points 
on the membrane which remain at rest. Such points form nodal Hrm, The position 
and the shape of the nodal lines, however, need not be the same for a given frequency. 
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Afl an lustration consider a rectangular membrane with a •> &. The frequency equa¬ 
tion (14-*7) then yields 

umn «* — Vm* + n* 
a 

w aVm* 4“ n* a » (14-9) 

a 

For m » n M 1, we get from (14-6) the fundamental mode 

Uu ■■ (All cos 4* Bn »m «jiO sm — sin —» 

a a 

where «ii • aV^, Since un « 0 for all t only when a; « 0, y « 0, a; « a, y « a, there 
are no nodal lines in the interior of the membrane for this frequency. If we take m *« 1, 
n »■ 2 and m »■ 2, n » I, we get two modes: 

« . . . . 2ry 

* (All cos «ii< + Bit sm ci>ttt) sin — sin-» 

a a 


2tx Try 

tin — (All cos <a 2 it 4- Bii sin a» 2 i 0 sin — sin —t 

a a 

with the same frequency, since t.>ji « wu *' a\/5. For y * a/2, Ui 2 « 0 and for z • 
a/8, till » 0. These nodal lines are sliown in Fig. 20. By forming linear combinations of 
the modes in (14-10) we can get oscillations with the same frequency but with different 
nodal lines. Thus, if we take An « An « 0 and form uu 4- 'Uti, we get 




fti*a 715 


Qfma -/TB 
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ttjj 4* tiji ■» flin wu^ (Bjt aia — siii + fti ran — eln 

\ a a a a/ 

■■ (sin o»xaf)2 sm — sm — I Bn cos-h Bji cos — I • 

a a \ a a / 

For this oscillation the nodal lines in the interior of the membrane are determined by 

Bn cos-h Ba cos — « 0. (14-11) 

a a 

Equation (14-11) for Bn « Bai yields the nodal line x -f y « a and for Bn * —B%i, 
the line x — y ««■ 0 (see Fig. 20). Different nodal lines can be obtained by forming dif¬ 
ferent linear combinations of the modes (14-10). 

The reader will show that for m »* n 2, all oscillations have the same nodal lines 
X » o/2, y « a/2, while infinitely many different nodal Unes can be obtained by form¬ 
ing different linear combinations of the modes uu and usi. A few of these are shown in 
Fig. 20. 

Since the nodal lines may be regarded as the boundaries of new membranes contained 
in the original one, the character of oscillation of membranes of different shapes can be 
deduced from the examination of nodal lines (see Prob. 3). 

Nodal lines can be observed experimentally by sprinkling a fine powder on the vibrat¬ 
ing membrane. 

PROBLEMS 


1. Suppose the initial conditions for the rectangular membrane considered in the 
text are 

■* 0.1 sin — sin tx<(x,y,0) « 0. 

a o 


(a) What is the frequency of the oscillation? (b) What is the maximum opeed attained 
by the mid-point of the membrane? 

2 . Analysis of a microwave resonant cavity leads to the equation 

utt * r*(wxr + Uw 4- u,») 

with the boundary condition u « 0 or du/dn ** 0 on suitable portions of the planes 
x»0, x*«a, y®"0, y ^ b, z«0, z«c 


(see Fig. 21). By assuming u ** XYZT show that the 
characteristic frequencies are 


Wmnp 

1.7 


0 T 


where m, n, and p are integers. 

3. A curve in the xy plane along which w * 0 for all f is 
called a nodal line, (a) Sketch the nodal lines for the oscill¬ 
ation (14-5). (6) Sketch the nodal line for the oscillation 


( . rx . 2wy . 2irx , ry\ . , o • 

am — sm -r—h sm — sm -7-) (A cos taut 4- B ran wjiO 
a 6 a b / 



Fio, 21 


which arises by adding the modes w -■ 1, n 2 and m ■* 2, n » 1. Hint: sin 29 
2 sin 9 cos 9. (c) Thus obtain one solution for the problem of a triangular membrane. 
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U. the Circular Membrane, Bessd Functions. To discuiss the oscil¬ 
lations of a circular membrane with fixed edges we introduce cylindrical 
coordinates With 

« lUxfi^) (15-1) 

the equation of motion (14-1) takes the form (cf. Sec. 12, Prob. 2) 

Uit « y^Urr + T-^Ur + (15-2) 

If the boundary is the circle r ~ a, then the boundary condition is 


V{afify = 0. (15-3) 

To make the problem definite we also introduce initial conditions 

Virfifi) = /(r), Viirm = 0 (15-4) 


which state, respectively, that the initial shape of the membrane is given 
by /(r) and that the initial velocity is zero. 

Since the initial sliape is independent of the solution presumably in¬ 
volves r and t only. Thus, we consider expressions of the form R{r)T{t) 
when applying the separation method. Substituting into (15-2) gives 


Id^T ^ ^nd^R 1 dR\ 

tIF ~ KRUF^TRltr) 


(15-5) 


after division by RT, Since the left-hand member of (15-5) depends on t 
alone and the rightrhand member on r alone, each side must be constant. 
It can be shown that the constant is not positive, hence may be written 
as — 0 )^. Thus (15-5) leads to 


+ 

II 

o 

/_^ 

dt 

(15-0) 

B" + r-'jB' -f k^R = 0, 

d 

dr 

(15-7) 


where fc - w/y. 

Equation (IS-fi'l is the familiar equation for simple hannonic motion, 
and Eq. (15-7) can be reduced to the Bessel equation by the substitution 
z « kr. Hence. (15-7) has a solution 

R *Iq(jz) </o(^0* 

The other solutions of (15-7) are rejected because they become infinite 
at r » 0, and we are led to the functions 

Jo{kr) sin cot or /o(^^) cos (at 

Since » 0 when f » 0, we reject the solution involving the sine. The 
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boundary condition (15-3) applied to our elementary solution RT now 

JoQca) cos wi »= 0 


for all t This requires that ka be a root of the equation Jo{x) » 0 (see 
Fig. 22 ). If the positive roots of Jo{x) are denoted by Xn, the appropriate 



choices of k are given by 
tions have the form 


kn — Xn/a, Since co = ky, our elementary solu- 
RT « Joikn^) cos knyt 


These functions satisfy the differential equation (15-2), the boundary 
condition (15-3), and the second initial condition (15-4). To satisfy the 
first initial condition we try to represent (7 as a linear combination of such 
terms: » 

^ ~ 2 ^nJoiKr) cos knyt. (15-8) 

n»«l 

When i » 0, the initial condition requires that 

00 

fir) = Z A„JoiKr). 

The problem of expanding an arbitrary function in series of Bessel functions 
was discussed in Chap. 2 , Sec. 22 . It was shown that the coefficients are 
given by 2 f« 

(IM) 

provided the series is uniformly convergent (but see also Chap. 2 , Sec. 23). 

In the terminology of the preceding section, the solution (15-8) is ex¬ 
pressed by means of the modes. The characteristic frequencies are 

Ky ^ ^ny 

2r 2r 2va 

and the fundamental is described by JoihT) cos * 17 #. 
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PROBLEMS 

1. The oscillations of a cylindrical resonant cavity satisfy 

uit « y^(ux* *f 0<r<o, 0<«<6 

with boundary condition u » 0 on the curved surface, u, «« 0 on the plane ends. Obtain 
solutions of the form R(r)Z(z)T(t) for this problem. 

2. Find the distribution of temperature in a long cylinder whose surface is kept at 
the constant temperature zero and whose initial temperature in the interior is unity. 

3. An elastic membrane subject to uniform gas pressure satisfies the equation 

ut< -h p « -b %/)» 

where p is a constant depending on the pressure. If the membrane is circular, show 
how to reduce this problem to a problem of the type solved in the text. Hint' Consider 
the function 

U{rAt) ~ u - ^^(7^ - a^). 


SOLDTIOW BY INTEGRALS 

16. The Fourier Transfonn. For many partial differential equations 
the desired solution can be expressed as an integral involving the initial 
or boundary values. This possibility was already illustrated by formula 
(3-10) for displacement of a vibrating string and by the solution of the 
Dirichlet problem given in (12-13). We shall now describe a systematic 
method of obtaining integral formulas. 

The function g{s) defined by 

1 ra 

T/ *= lim ■ ;- •• = = / e *^*/(x) dx = g{s) (10-1) 

Q -^ lOI, 

is called the Fourier transform of f{x) ; the operator T is called the Fourier 
transform operator. The inverse operator is obtained by changing 
the sign of i, so that the foregoing equation may also be written 

1 /•« . 

T ‘g “ lim —y— / e“*g(s) ds = /(x). (16-2) 

o -♦ « \/2ir *'—“ 

When such is the case, the symbol T satisfies the easily remembered equa¬ 
tions 

TT-y - f, T-^Tf - /. (16-3) 

If the limits in (16-1) and (16-2) are regarded in the sense of mean convergence (Chap. 
2, Sec. 23), and if the integrals are regarded as Lebesgue integrals (Appendix C), then 
(16-1) gives (16-2) and (16-2) gives (16-1) provided either of the integrals 
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is finite.* Both integrals (16^) then have the same value. In many physical problems 
the common value represents the total power or energy present in the system. 

To illustrate the use of the Fourier transform, we shall solve the problem 


W, = 0?Uxx, 

f > 0, —00 < X < 00, 

(16-5) 

it 

o 

— 00 < X < 00, 

(16-6) 

u{xfy —+ 0, 

as i » 00 . 

(16-7) 


Physically, this system describes the temperature u{x^i) of an infinitely 
long bar at point x and time t when the initial temperature u{xfi) is known. 
The trial solution u = with p and q constant leads to 

when substituted into (16~5). Hence q = and the trial solution is 

^VX-\-a^ph 

We choose negative because of (16-7). Thus p = ts, where s is real, 
and the trial solution is now 

(16-8) 

We shall satisfy the initial condition (16-6) by forming a linear combina¬ 
tion ’ of solutions (16-8). Thus 

y/2v 

is a solution of (16-5) no matter what value gis) may have, and the integral 
u{x,t) = ds 

V2ir 00 

is also a solution, provided we can differentiate under the integral sign. 
By (16-2) the latter expression can be vTitten 

u{x,i) = (16-9) 

Setting t = 0 and using the initial condition (16-6) give 

/(x) = T-V(a), 

* This important theorem, known as PUmchereVs theorem, is proved in E. C. Titch- 
marsh, ‘Tntroduction to the Theory of Fourier Integrals/' Chap. 3, Oxford University 
Press, London, 1937. For a heuristic discussion of the relation between (16-1) and 
(16-2) see Chap. 2, Secs. 20 and 21, of the present text, 

*Thi« procedure is analogous to the formation of Fourier series in the method of 
separating variables. 
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SO tliat tf « TT"“V *" 9* Substituting into (16*9) we get the final answer, 

u(x,t) ^ (16*10) 

This is an explicit formula for the temperature u{Xyt) in terms of the 
initial temperature/(x). 

As another example we shall solve the Dirichlet problem for a half 
plane. Several physical interpretations were given in Sec. 12 ; the mathe¬ 
matical formulation is 

Uxx + Uyy = 0 , y> 0 , -00 < X < 00 , (16-11) 

u(xfi) « f{x)f —00 < X < 00, (16-12) 

u(x^y) —^0, as y —* oo. (16-13) 

The function satisfies (16-11) if — 0 . We choose q real 

and negative because of (16-13), and hence p is pure imaginary, p = is. 
The trial solution is now 

l«ly 

^ } 

when we note that and that g is negative. This function satisfies 

(16-11) and (16-13). To satisfy (16-12) we form a linear combination as 
in the previous example, thus: 

1 

M(x,y) = -7= / ds s 

*\/27r 

Setting p — 0 we get / — T“V by (16-12). Hence g = T/, and the solu¬ 
tion ^ is 

u{x,y) - (16-14) 

As a final example we shall consider the problem 

utt = u(XyO) = /(x), ut(x,0) == g(x) 

which describes weaves on an infinite string (cf. Secs. 2 and 3). The trial 
solution e*'*'^^* yields the two expressions 

Forming a general linear combination, we get 

u{x,t) - (16-15) 

where gi and §2 are to be determined from the initial conditions. 

If (16-15) can be differentiated under the integrals wliicJi are implied 

in the result is 

ut{x,t) « T^^iase^^giis) — (16-16) 

‘ Formulas of the type (16-10) and (16-14) are discussed in E. M. Eedheffer, Operators 
and Initial-value Problems, Proc, Am, MiUh, Soc,, 4 (August, 1953)* 
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Setting < 0 in (16-16) and (16-16) now gives, respectively, 

/ = T-'ffi + T~*^at 9 “■ 
or, after operating on the equations with T, 


9i + 92 

Solving for gi and g 2 , 

1 


T/, tos(^i - Pa) = Tg. 


9i 


^Tf + ^Tg, 
2 2icis 


1 1 
= - T/ - — Tg, 
2 2tas 


(16-17) 


and this gives the final answer upon substitution into (16-15). 

The foregoing result can be deduced from d’Alembert’s formula (3-15). 
However, the method of Fourier transforms also applies when d’Alembert’s 
method fails (cf. Probs. 1 and 2). 


PROBLEMS 


1. According to Sec. 7 the equation for damped motion of waves on a string is 

Uit — a^Uxs “ — 26 u<. 

Obtain a family of solutions of the type 

u(x,l) « V#) + 

by starting with u ** c***"^*^ and forming a Imear combination. 

2. Formulate appropriate initial conditions for Prob. 1. and use them to determine 
and g 2 - 

3. The displacement u(x,t) of a long, stiff rod satisfies 

^ - MOf f « force, 

when the mass is negligible (cf. Sec. 2, Prob. 6). Let U(s,t) be the transform of u with 
riispect to the variable x, and let F(8,t) tie the transform of /. Neglecting convergence 
questions, show that Eh^U »» F, and thus obtain the solution in the form 

u(x,0 « 


Hint: Write out the expressions 

u{x,i) - T~^17(s,0, 


/(x,0 « 


in full, and substitute into the differential equation. 

4. If the mass of the rod in the preceding example is m, the equation of motion is 


El -r*^ 4" - 


dx* 


df 


*/(x, 0 . 


Show that u « T~^r7, where U -» U(s,t) satisfies the ordinary differential equation 





486 TmrviL piffebbntui;i equations (ckap. 6 

17, Waves in a Half Hane* The Fourier transform can be used to solve 
the twoHiimensional wave equation 

y^(uxx + Uyy) « uu, 7 =“ const, (17-1) 

and the result has an interesting physical interpretation. We suppose 
that the time dependence is harmonic, so that 

uix,y,l) = (17-2) 

where <*> is constant. Substituting in (17-1) gives the scalar wave equation 

Uxx+Uyy + J^U ^0, (17-3) 

7 

This equation will now be solved in the half plane i/ > 0 subject to the 
additional conditions 

U{xfi) « fix), Uix,y) -^0 as 2 / oo. (17-4) 

Physically, the solution describes the radiation field of an antenna ^ when 
the aperture illumination is fix) (see Fig. 23). 



By substituting the function into (17-3) we obtain solutions 

(17-5) 

Because of the second condition (17-4) the coeflicient of y in (17-5) has a 
negative real part when s is large; we shall indicate this by dropping the 

^ Formulation of (17-3) in this context and discussion of the conditions for a unique 
solution can be fotmd in treatises on electromagnetic theory. 
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minus sign. Finming a linear combination ot expressions (17>5) as in the 
preceding Section, 

Uix,y) - -4= r ds » 

For y « 0 the first conditicHi (17-4) yields g » Tf, and hence 

U(x,y) - (17-6) 

Multiplying by as in (17-2), we get the corresponding solution of 
(17-1). 

The discussion of (17-1) given here contrasts to that given in connection with the 
vibrating membrane (Sec, 14). For the membrane we specified the initial values of 
u and ut, and we obtained a series involving infinitely many oscillation frequencies. 
Here, on the contrary, the frequency w/2r was prescribed in advance. By (17-2) the 
initial conditions are 

ulx.yfl) - V{x,y), ui(x,y,0) « -t«f^(x,y) 

and the first condition cannot be specified arbitrarily, inasmuch as U(x^y) satisfies (17-3). 

To interpret the solution physically, we have 
u(x,y,t) =• 

by combining (17-2) and (17-6), with T/ = g{»). Writing out jo full, 
uix,y,t) - ^ /_^ ds. (17-7) 

For simplicity we shall suppose that g($) » 0 when \s\> k. The limits 
( — 00 ^ 00 ) of the integral can then be replaced by (~A:,fc). If we now in¬ 
troduce a variable 

a « ife sin ~ ^ « fc cos (17-8) 

we get 

u{x,y,t) = f «“** *+*» •“ sin e)k cos d dB. 

V2ir •'—«■/» 

This formula expresses the solution as a superposition of functions of the 
form 

^i{kx sin 9-^kv cos 9—vt) (17-9) 

In the next paragraph it will be seen that (17-9) represents a plane wave 
traveling with velocity y in the direction 0 (Fig. 23). Hence, the Fourier 
transform procedure gives the plane-wave expansion of the antenna fields 
The amplitude of the wave moving in a given direction $ is 

g(k sin ^) A cos & dS, 

where p(s) is the Fourier transform of the aperture illumination. 
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To see that (17-9) rejpreoeote a plane wave we examine the (x,y) locus on which (17-9) 
is constant. Without essential loss of generality we confine our attention, in particular, 
to the locus along which the exponent is zero. The equation of this locus is 

«sinS 4- ^ cos d •» (17-10) 

when we divide by k and replace k by its value (17-3). 

The wave front (17-10) is clearly a straight line, and the wave fronts for different 
is are all parallel. In fact, their common perpendicular makers an angle $ with the y 
axis. Since the distance from the line (17-10) to the origin is yt, the vtdocity of propaga¬ 
tion is the constant 7 , and hence the desired result is established. It is possible to give 
a similar discui^ion for the part of the integral (17-7) \\ith U’i > k, though we shall not 
do so here. 

18. The Convolution Theorem. The convolution f * g oi two functiona 
/ and g is defined by 

/♦ p = lim / /({)p(a: - {) ^ j f(^)g{x - {) df (18-1) 

if the limit exists either in the ordinary sense or in the sense of mean con¬ 
vergence. The importance of the operation (18-1) rests on the following 
theorem: 

Convolution Theorem. Let /, |/j^ and \q\^ be integrable^ and let 

all infinite integrals be interpreted in the sense of mean convergence. Then 
the product of the transforms eqiuds the transform of the convolution. In 
symbols, 

- mUg), (18-2) 

Although a complete proof requires knowledge of Lebesgue integration and mean 
convergence, the result can be made plausible as follows We have 

T(/. /__/(!«. - 0 Js] d. 

provided the order of integration may be inverted. If the variable x in the inner integral 
is changed to f « x - we get 

k £ [ £•<'>■-''*"• ■“] «>«- {yr. £'“>■■"■ ■«) £'“>'■"■ ■“)• 

and this is (T/)(T^). 

By means of the convolution theorem some of the foregoing results 
cam be greatly simplified. Taking the transform of the formula (16-10) 
with respect to x gives 

Tu <= 


(18-3) 



t 
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for the temperature u(x,0 of a rod when the initial temperature is f(x). 
By consulting a table of Fourier transforms or by usmg the result of ]^ob. 
1 , 

g-aVi „ ^ (2a2<)-He-**/«**<>. 

Hence, the result (18-3) may be written 


Tu =. (T/)(Tp) ~T(f*g) 

when we recall (18-2). Taking the inverse transform now yields 

u(x,t) —/* p =» (4xa*t)~^ f (18-4) 

«0 


The advantage of this formula is that it involves only a single integration 
whereas (16-10) requires two integrations. Since the integral is rapidly 
convergent when t is not too large, (18-4) is well suited for numerical 
computation. 

To obtain a physical interpretation of (18-4), let the rod have the initial 
temperature zero except for a short piece on the interval (xq — c, xq + «) 
(see Fig. 24). If Q cal of heat is uni¬ 
formly distributed over this element j j 

of the rod, the corresponding initial r.......-- 1 

temperature / is given by *o 

Q =» 2€Cpfj Xo — €<x<xo + « ^ 

where c is the heat capacity and p tlie linear density. By (18-4) the resulting 
temperature at point x and time t is 


u{x,t) 


- - - i 

Cp{4TrOlH) ^ 2t •'aco—f 


Letting « 0 and using the mean-value theorem we get 

Cp{4va^t) ^ 


(18-6) 


This gives the temperature distribution for an instantaneous source of 
strength Q at the point xq. Now, Eq, (18-4) represents the temperature in 
the general case as a superposition of such sources. The source at x f 
has the strength 

Q « epfi^) 


Am we shall see in the next section, this physical interpretation enables us 
to solve a variety of problems in heat flow with the greatest ease. 
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Maampk: Let tt^x^y) be harmonio for y > 0 and satisfy the additional condtUons 
u(xfi} — /(x), u(x,y) -♦ 0 as y 
Show that tt w given by the Poiason formula for a half plane: 

u(x.v) - ? r -(18^) 

r (x - 0* + y* 

Sinoe this problem is the same as that in (16-11) to (16-13), the solution is given by 
(16-14). Taking the transform of (16-14), 

Ttt « (18-7) 

The convolution theorem can be applied if we express c'“****' as a Fourier transform. 
To this end we compute the inverse transform 


V2t Jo \/2v 

\/2t Vy — ix y -f tx/ 


( 18 - 6 ) 


This shows that «■ Ty, where g is the function (18-8): 

1 2y 
!* + !/»■ 

The convolution theorem applied to (18-7) now gives u ^ f * g, and that is the desired 
result. 


PROBLEMS 

1. Let I(x) where c is constant, (a) Differentiate, and integrate the 

result by parts to obtain 

dx 2c 

(b) Using Eq. (10-1), Chap. 8, find the value of 7(0). (c) Thus deduce the formula 

« (2r)~^e~**/^®. 

(In particular, and c'*'**^* are transforms of each other.) 

3. Obtain the temperature distribution for a rod extending from x «• 0 to x « 

if the initial temperature is f(x) and the end x *» 0 is insulated. Ilini: Consider a rod 
extending from — oo to «, with initial temperature fo(x) defined by 

fo(x) even, fo(x) « f(x) for x > 0. 

Compare Prob. 5, Sec. 10. 

8. By taking fo(x) odd in Prob. 2, find the temperature distribution when the end 
X « 0 is not insulated but is kept at the temperature aero. 

4. A rod extending from x * 0 to x «• 1 has the initial temperature distribution /(x). 
By regarding this rod as part of an infinite rod with the initial temperature /o(x), find the 
temperature u(x,t) when (a) both ends are insulated, (b) both ends are kept at the tem¬ 
perature sero. jffint: Let fo(x) have period 21. This method of satiafying boundaiy 
conditions was used for the vibratingnstring problem in Sec. 6. 
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19« The Source Functions for Heat Flow. According to (18*5) the 
function 






(19-1) 


represents the temperature distribution due to an instantaneous source 
of strength Q at the origin. Equation (19-1) applies to the one-dimensional 
heat equation «= Ut^ The corresponding result for two dimensions 
is 

__0 r* - *» + (19-2) 

and for three dimensions it is 


^ g-r*/4a»l 

pc{AiraH) ^ * 


+ 2 /^ + 


(19*3) 


In these formulas r is the distance from the source to the 'point of observation 
and i is the length of time that has elapsed since the heat was released. The 
value of p is, respectively, the linear, surface, or volume density. 


The functions (19-1) to (19-3) are solutions, respectively, of 

a^Uxx “ Wt, a\u„ -f ityy) - Ut, a^Utx -f -f «*») • 

Also they give the limit 0 as ( 0 through positive values, provided r 0. Hence 

the initial temperature distribution is concentrated entirely at the origin. By integrating 
over the whole space it can be shown in each case that the total amount of heat present 
is Q when t > 0, For these reasons, the physical interpretation as a point source oj 
strength Q is fully justified. 

The expressions (19-1) to (19-3) indicate that heat travels with infinite speed. Even 
if r is large, we get a positive temperature for each positive no matter how small, but 
the initial temperature was zero. By contrast, the disturbance associated with the 
wave equation travels with finite speed (cf. Secs. 2 and 4.) 

To illustrate the use of (19-3) let us find u{x,y,z,t) when the initial 
temperature 

u{xi,yi^i,0) = f{xi,yi,zi) 


is given at each point ixi,yi,zi) of space. Instead of this distribution we 
introduce a source of strength 

<3 = cpfixi,yi,Zi) dxi dyi dzi (19-4) 

at {xi,yi,zi). The temperature at point (x,y,z) and time t due to one such 
source is given by (19-3), with Q as in (19-4) and with r the disttmce from 
{x,y,z) to the source: 

r* - (*1 - a:)® + (jfi - y)® + (zi - s?. 
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Hie temperature at (x,y,z) due to aU the sources is given by superposition: 
«(x,y,z,0 = (4ira®0~^f f f dxidyidzi. (19-5) 

J —00 j —oo j —00 

As another illustration we shall find the temperature due to a point 
source which emits heat continually. Let Q{t) represent the strength of 
the source, so that the amount of heat emitted in time interval (<i, h + dti) 
is approximately Qih) dti. The heat at the present time t due to the source 
at time ti is \ 

pc[4ira£^(^ — 

when we recall that i in (19-3) stands, not for the time, but for the elapsed 
time. Adding the contributions from the source at all values of ti prior 
to t gives 


u 


ac 


p-r*/[4a*0- 


pc [4^ra^(^ — ti)]^ 


dh 


(19-6) 


If Q(t) is a constant Q, the integral can be evaluated explicitly by the change of 
variable 


die result is 


u - 


4a*(t - to ‘ 

Q 1 

4ira^pc r 


(19-7) 


This represents the temperature due to a continuous uniform source of heat at a distance 
f from the point of observation. Since the conditions are steady state, the solution 
satisfies Laplace's equation. (Compare Sec. 12, where the function 1/r was obtained 
in connection with electrostatics and gravitation.) 

Exampk: A line contact is pressed against the 
plane x » 0 with constant normal force F per 
unit length, the coefficient of friction being a 
constant p. At time 1 « 0 it starts to elide in 
a direction jierpendicular to its length with con¬ 
stant velocity v (see Fig. 25). Obtain the tem¬ 
perature in the medium x < 0, assuming this 
temperature to have been zero initially and 
neglecting heat loss at the surface x « 0. 

This problem arises in the theory of milling, 
leather glazing, and lathe turning. To solve it, 
let the line contact be initially coincident with 
the z axis, so that its height at time is v 
The heat generated by friction per unit length 
is Ffi dy, and hence the heat generated per unit 

Q ■" Fyvdti. 

Using this value of Q in the result (19-2) we obtain 



-/ 4 «i(l-L) 

ipcc^i-to 
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for tlie contribution^ at the point (x^y^x) and at the present time t, due to motion of the 
line contact in the time interval ((i, h -f dti), {Hie reader is reminded that t in (19^2) 
stands for the elapsed time and -4* in (19-2) is the square of the distance from the 
point of observation to the line source.) Superposition yields the final answer: 

Ffiv rt 


PROBLEMS 


1. Show that (19-2) can be obtained by integrating (19-3) with respect to z and (19-1) 
by integrating (19-2) with resfMJct to y. Interpret physically. 

2. What initial- or boundary-value problem is solved by (19-6)? By (19-6)? 

8. By use of (19-2) solve the initial-value problem 

a^(Mx» -^Uyv) u(x,yfi) « f(x,y), 

4 . Find the temperature distribution u(x,y,z^i) for x > 0 due to a time-dependenf 
distribution /(i/,r,0 on the plane x ■■ 0. Take the initial temperature as zero for x > 0 

5. State and solve the two-dimensional analogue of Prob. 4; the one-dimensional 
analogue. 


20. A Singular Integral. We shall now derive an integral formula which 
can be used in the study of many partial differential equations. The 
discussion depends on certain theorems of vector analysis ^ s ummar ized 
in the following paragraph. 

In the divergence theorem 

j (V*A) dr » j kn d<r, 


dr ^ volume element 
dc = surface element 


( 20 - 1 ) 


the choice A ** uVv yields Greenes first identity 


f o r dv 

I [uV^t/ -f- (Vv • Vt;)] dr — u — d<T 

Jr Jff dn 


(20-2) 


when we recall that (Vi>)n = dv/dn^ the normal derivative. Writing 
(20-2) with u and v interchanged and subtracting give Green's sy^mmetric 
identity 

I {uVh — vV^u) dr ^ I \u - V — ] do'. (20-3) 

-'r Jer \ dn dn/ 


The conditions for validity of these identities are discussed in Chap. 6. 
For our present purposes we need an appropriate form of (20-3) when 
V does not satisfy the continuity conditions there required. 

* The reader may find it advisable to review Chap. 5, Secs. 8 to 10. Unless otherwise 
indicated, the functions considered in Secs. 20 to 22 are twice continuously differentiable 
in the rei^on r and on its boundary. The surface of r is assumed smooth, so that the 
normal is a oontinuous function of position. 



494 


PABTUL DIFFKRBNTUL EQUATIONS 


[OHAP. 6 


To tilis end, we diow tliat 

|4ir/(P), forc»2, 
a _ 0 •'»< o® lO, for c < 2, 


(20-4) 


where / is continuous, c is constant, and the region of integration ai is 
the surface of a sphere of radius a centered at P. Now, the integral (20-4) 
may be written 


o® •'»> a® ■'» 


KP) 


/(Q) -m 


c?(r « + Jj. 


Since the area of the sphere is 4irtt^, we have, as a 0, 


h 



m 


4xa^ —> 


j4x/(P), 

lo, 


for c = 2, 
for c < 2. 


Since a surface integral does not exceed the area of the surface times the 
maximum value of the integrand, we have 


h I < 4Ta'^-- 


<4irmaxl/(Q) •-/(P)|. 


If / is continuous, this tends to zero as a —► 0, and 
(20-4) follows. 

Let us now apply (20-3) to the function 
1 

t) « 1 /; 4 - r = r(P,Q), (20-5) 

r 

where w is twice continuously differentiable and 
where r is the distance from a fixed point P to tlie 
variable point of integration, Q. The region of 
integration is to be the region inside a given closed 
surface <r and outside a small sphere (tj of radius a 
centered at P (see Fig. 26). In this region r 0 
and (20-3) can be applied without hesitation. 

According to (20-5) we have 

Vh » ~ ( 20 - 6 ) 



On <ri the outward normal n is directed along the radius into the sphere, 
so that 

1 

dnr dr r 
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Since r » a on 0 ^ 1 , the foregoing equation and (20*5) give 
dv dw 1 1 

— SB -—I- !;««> + •"» on <ri. (20-7) 

dn dn a 

The surface integral in (20-3) can be written as an integral over 0 * plus 
an integral over crj. By inspection of (20-7) the integral over <ri is 

r r /dw l\ / l\dlil 

'Fhis becomes 4iru(P) as a 0, in view of (20-4). If we use this result 
and (20-6) in (20-3), we obtain the desired formula 

4tu(P) = f (uV^w — vV^u) dr + f (v - u — ) da (20-9) 

''r ’'9 \ dn dn/ 

upon letting a —♦ 0. When P is exterior to r, the same formula is valid, 
except that 4iru(P) must be replaced by 0. 

Since the volume integral in (20-9) is taken over the whole region t, it includes the 
point P at which v *■ «. The meaning of the integral is clear from the derivation, 
but we shall show directly that a singularity of the type 1 /r in a volume integral causes 
no convergence difficulties. If n is the interior of the sphere with surface <ri, we have 

/ ~ dr « f - 4irr* dr *■ 2ra*, 

Jri r Jo r 

This is clearly finite and in fact tends to ssero as o 0. 

21. The Poisson Equation. If u has continuous second derivatives, then 
the Laplacian V^u is a continuous function of position. We shall denote 
this function by — 4irp(x,j/,2), so that 

V^w s® —4 tp. (21-1) 

The choice f;«l/r, «?«0in (20-9) now yields the Poisson formula 

rp I f/I du a 1\ 

u(P) «/>dT + -/ (-_-~u_-)dcr (21-2) 

when we divide by 4ir. As before, r « r(P,Q) is the distance from P to 
the variable point of integration Q. The formula (21-2) holds for eveiy 
function having continuous second derivatives in t and on its boundary.^ 
We now change our viewpoint. Instead of starting with u and defining 
p by (21-1), we suppose that p is given in advance. Equation (21-1) 
is now a partial differential equation for the unknown function u; it is 

> Provided the boundary is ample enough to permit the use of (26*9). This condition 
on r is hereby postulated once for all. 
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cdled the Pm^mm equation. The foregoing considerations show that if n 
satisfies the Poisson equation, then u is given by the Poisson formula. 
The interest of the formula is that it yields the values of u throughout the 
interior of r in terms of u and du/dn on the surface only. 

For a physical interpretation, let u be the electrostatic potential due to a 
charge distribution of density p. The fact that tlie potential satisfies 
Poisson^s equation is established in treatises on electrostatics,* so that this 
interpretation is consistent with (21-1). Since g/r represents the potential 
due to a charge g at a distance r from the point of observation, the term 

1 

- (p dr), where r « r(P,Q) and p = p(Q), 
r 

represents the potential at P due to the charges within the volume element 
dr at Q. Hence the first term of (21-2), 

/ “ (p dr), 

Jr f 

represents the potential at P due to charges within the body r. Similarly 
the second term in (21-2), 



represents the potential at P due to a certain surface-charge distribution 
on the surface e. 


To interpret the term 

in (21-2), we consider the configuration shown in Fig. 27. Here, a charge —g is in¬ 
troduced at the point Q on the surface <r and a charge -f-g at a distance An along the 
outward normal n to or. The distance from —g to F is r, and the distance from q to P 
is denoted by ri. If we take q « m/An, where m is constant, the potential at F is 

1 1 1 A(l/r) a 1 

q --► m- 

ri r r An dnr 

as An 0. [That the limit is m d(l/r)/an follows from the definition of normal deriva¬ 
tive, without calculation.] The limiting configuration of Fig. 27 is called a dipole; 
the constant m is called the moment of the dipole. We have thus found the desired 
interpretation of (21-3); namely, (21-3) represents the potential due to a surface distribu- 

* The case p - 0 in (21-1) is discussed in Chap. 6, Sec. 14. A detailed analysis of the 
eondiUons under which Poisson’s equation holds may be found in 0. D. Kellogg, *Toun- 
dations of Potential Theory,” p. 166, Springer-Verlag OHG, Berlin, 1929. 
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Um df di{K)l6s lutvlikg tiu» amomtM A surfaoe distribution of d^ioloB mnsh 

as this is csaQsd a dmMe layer. 

Since the volume integral in (21-2) ie extended only over r, it doee 
not take account of the charges outside r. That purpose is served by tihte 
surface integral in (21-2), From this viewpoint (21-2) shows that Om 
charges mdside r can be replaced by a suitable surface charge and double 
layer on cr, without changing the potential within r. If r increases beyond 
all bounds, the limiting value of the surface integral can be thought to 
represent the influence of the charges at infinity. 




In many important problems there are no charges at infinity, so that the 
limiting value of the surface integral is asero. To investigate this possibility, 
let <r be a large sphere of radius a centered at the origin 0. By inspection 
of the differential triangle in Fig. 28, 


Ar An cos ^ An cos as An —> 0 

where ^ is the angle between OQ and PQ, The definition of normal deriva- 

tive leads to 

dr ^ Ar 

— lim — sa cos iff 
dn An 

(21-4) 

and hence 

d 1 1 dr cos^ 

dn r r^ dn r^ 

(21-5) 


If P is fixed and a —> it is easily seen that r ^ a uniformly with 
respect to the point Q on <r. Hence 1/r has the order of magnitude 1/a, 
and by (21-5) the normal derivative has the order of magnitude 1/a^. 
Now, the surface integral in (21-2) does not exceed 4ira* times the maximum 
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of the integrand. By the foregoing remarks, the integral therefore tends 
to aero as a --> «> if 

—^ 0 and max jw| —> 0, as a —> «, (21-^) 

In this case (21-2) leads to the simple formula 

w « f -dr, integrated over all space. (21-7) 

^ r 

Referred to spherical coordinates* (r,0,<^) the normal derivative du/dn 
in (21-6) is the radial derivative du/dr^ and o == r. Thus (21-6) is equiva¬ 
lent to 

du 

lim r — = 0, lim n * 0, uniformly in 6 and 0. (21-8) 

r ^ m dr r « 

By substituting (21-5) into the integral and regrouping terms one finds that (21-8) 
may be replaced by the weaker condition 

( 0u \ u 

f — -f li I «■ 0, lim - •» 0, uniformly in 6 and (21-9) 

dr / r-+oor 

That is, if u satisfies (21-1) and (21-9), then u can be represented in the form (21-7). 
When p *» 0 outside a bounded region, an altogether different procedure * shows that 
the seoond condition (21-8) also suffices. 

Example: If the region r is a sphere of radius r© centered at F, every solution of Pois¬ 
son’s equation satisfies 

u(P) - ^ jf u + jf (; - ^) /> dr. (21-10) 

Here r * r(P,Q) is the distance from P to the variable point of integration Q. To 
prove (21-10) we choose ii? ■» — 1/ro in (20-5) and note that on a 

^ dv _ 1 I 
' dn dr ** rj 

The desired result follows at once from (20-0). The special case p » 0 in (21-10) yields 
the Avkhaob-valub Theorem: If a function is harmonic throughoui a sphere, its value 
ai the center of the sphere equals the average of the values on the surface. This fact is of 
central importance in the study of harmonic functions. 

The merit of taking ip » — 1 /ro in (20-5) is that then s •» 0 on ir. Hence the term 
involving du/dn in the surface integral (20-9) drops out The possibility of making 
such a choice of v will be systematically exploited In Sec. 23. 

* The r in (21-8) has no relation to ttie r *■ fiP,Q) that appears elsewhere in this 
dmmmcm, 

•See H. B. Phillips, "Vector Analysis,” p. 168, John WOey St Sons, Inc., New York, 

im 


a max 


du 

dn 
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PROBLEMS 


1. If u ia hMmonlo, Acm that the choioe u r in CSO-2) gives 

j(ul+i^ + t^ dr - j^u^dr. 

fi. Show that a solution of Poisson’s equation in a closed bounded surface ^ is wholly 
determined by its boundary values and that it is determined, apart from an additive 
constant, by the boundary values of the normal derivative. Hint: If ui and us are two 
solutions, apply Prob. 1 to u ui us and then use the result of Prob. 4. 

8 . Let u be harmonic in a region r, and suppose u assumes its maximum value uo 
at an interior point P. Show that u ^ uq throughout every sphere contained in r and 
centered at P. Hint: If Af(u) denotes the mean value on the surface of such a sphere, 
then Uo •»* M(u) and henoe Af(uo — u) * 0 . Now use Prob. 4. 

4. Let fiQ) be continuous and nonnegative in a region r. U J f dr » 0, then/ m 0. 

Similarly for surface integrals. Hint: If / • € > 0 at an interior point P, then by 
continuity / > «/2 throughout some sphere n of radius 5 > 0 center^ at P. But Uiis 

gives j f dr > J /dr>jf (o/2) dr > 0 . 


22. The Helmholtz Fonnula. The Helmholtz eqmtion 

« 0 ( 22 - 1 ) 

is obtained by separating variables in the wave equation (Sec. 14) cr by 
requiring harmonic time dependence (Sec. 17). A brief calculation ^ shows 
that (22-1) has the solution where r is the distance from a fixed 

point P to a variable point Q, If we set 

6 ***' 1 

i; ass —, tp * r- (22-2) 

r r 

it follows that V^w » Vh «= for r 0. Hence 

uV^w — vV\ » ui—kh) —v{—k^u) = 0 , ( 22 - 3 ) 


provided r 5 *^ 0 and provided u satisfies (22-1). Substituting (22-2) into 
(20-9) with due regard to (22-3) now yields the Helmholtz formula 


m 


1 

4ir ' 


r Bn dn \ r / i 


da. 


(22-4) 


<»cpreaiW Ihe solutioc u of (22-1) as an integral involving the botmdary 
valim of u aqd Bu/Bn. 

Sometimes the region r is bounded and (22-1) holds at points exterior to 
r. To see if (22-4) remains valid in ttiis case, we construct a sphere n 


* Let the Laplaoiau be referred to spherical coordinates with origin at P. 
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centemi at the origin and having a radina & so large that r is contained 
entirely within ri (Fig. 29). Formula (22-4) applied to the region between 
t and the surface <ri of the sphere gives 


4ru{P) rm I - u — (— I Idcr + f — 

L r dn dn \ r / J r 


dn 



dor. 



In the first integral the outer normal for the region of integration is the 
inner normal for r. With this understanding we see that (22-4) holds in 
the present case, provided the integral over cri tends to zero as a 

To investigate the behavior as a —► note that on ai 

---«, — 1,^ — Icoe^ 

dn r dr r dn r \ r/ 


by (21-4). Hence the integral over vi becomes, after reBurangement, 



— iku 4" — cos f) -h ~ w cos ^ J d<r. 


(22-6) 


As a —► 00 with P fixed, we have a'^r. Also, the law of cosines applied to Fig. 2S 
shows that o*(l — cos ^) remains bounded. Hence, the integral will tend to 0 provided 


Umax 



0 and max | w | —► 0, as a «o. 


This assumes k real, so that le**"! 1. 

In spherical coordinates (r,0,^) the result of the foregoing discussion 
may be summarized as follows: Formula (22-4) applies to the exterior oj 
the bounded region r provided 


r 


du 



^ 0 


(224) 


and tt —♦ 0 
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OB t unifomly in d cmd 4t. In just the same way it is found that 
(21-2) applies to the mcterior of the bounded region r provided (22-6) holds 
with A: 0. 


Equation (22-6) with k 0 is sometimes called the DirichUt condition. It is the 
same as (21-8), hence means that there are no charges at infinity. Equation (22<^) 
with ^ ^ 0 is called the Sommerfeld radiation condition; it means that there are no 
sources of radiation at infinity. 

Although (22-6) is the form usually given, it is unnecessarily restrictive. A more 
careful analysis of (22-5) shows that (22-6) may be replaced by the weaker condition 


( du u\ u 

— — iku + “) 0 and “ (22-7) 

which reduces to (21-9) when A; -« 0. 

23. The Functions of Green and Neumann. The Laplace equation can 
be obtained by setting p = 0 in Poisson’s equation (21-1) or by setting 
/c = 0 in the Helmholtz equation (22-1). The corresponding integral 
formulas, (21-2) and (22-4), both reduce to 


u{P) 



d 1 \ 

u -) da. 

dn r/ 


(23-1) 


This expresses every harmonic function in r as an integral involving the 
boundary values and the boundary values of the normal derivative. Ho'W- 
ever, a harmonic function is determined by the boundary values cdone^ 
without any reference to the normal derivative.^ We shall now obtain a 
formula, similar to (23-1), in which du/dn is not present. 

Such a formula can be found by an appropriate choice of v in (20-9). 
Since V^u =« 0, the volume integral in (20-9) will drop out if 

vV = 0 throughout r. (23-2) 

And the tenn involving du/dn in (20-9) will drop out if t; « 0 on or. By 
(20-5), that condition is equivalent to 

to as-on <r. (23-3) 

r 

Evidently, (23-2) and (23-3) determine w uniquely. Since the function 
r « r(P,Q) involves the fixed point P, the boundary condition (23-3) 
makes w, and hence v, depend on P. The function v obtained in this way 
is called Green's function and is denoted by Cr(P,Q). Thus, 

1 

G(PfQ) =* i; t* m -f - 

r 


(23-4) 
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where r » r(P|Q) and where w satisfies (23r2) and (23^). 
(20^9) now yields 


u{P) 


1 f Bv If 

— / u — dtr »- I u 

4ir Bn A** 'a 


BQ 


iv Bn 


dir, 


The formula 
( 23 ^) 


with G given by (23-4). The differentiation and integration in (23-5) are 
with respect to Q. 

What we have shown is the following: Let u satisfy 

V^u « 0 in r, u » / on or, (23-6) 

If the region t has the Green function G, then 

1 r dG 

u(P) «-/ /(Q) —da, (23-7) 

4ir dn 

When a continuous function / is given in advance, it can be shown, con¬ 
versely, that the function u in (23-7) satisfies (23-6). In other words, 
formula (23-7) solves the Dirichlet problem. The general Dirichlet problem 
is thus reduced to the special Dirichlet problems ‘ that have to be solved 
in constructing Green’s function. 

To interpret Green’s function physically, let a unit charge be placed 
at the point P interior to a closed, grounded conducting surface a. Since 

P is the only charge present, the potential 
has the form t? — t/; -f 1/r, where «= 0. 
Since the conductor cr is a grounded equi- 
potential, v « 0 on cr, and hence, v agrees 
with the V in the foregoing paragraph. 
Thus, G(P,Q) is the potential at Q due to a 
unit charge at P in the grounded conducting, 
surface a. Because of this interpretatior 
the existence of Green’s function is very 
plausible on physical grounds.® 

The physical interpretation not only suggest 
that 0{P,Q) exists but gives a method of findini 
it in many cases. As an illustration, we shal 
construct Green’s function for the half space z > i 
(Fig. 30). Let a charge g 1 be placed at P and a charge g • —1 at Pi, the mirro 
image of P in the plane * • 0. By symmetry the potential v 0 when « « 0, an 
hence p is Greenes function. If r is the distance from P to Q and ri the distanc 
from Pi to Q, the potential is 

^ One problem for each choice of P. 

* A proof of the existence for all regions likely to be met in practice is given in 0.1 
Kellogg, 'Toundations of Potential Theory/* chap. 11, Springer-Verlag OHG, Berlii 
1929. 
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o(p,Q) -1 - i. 

T n 

As in the deriTation of (21-S), 


dn 


2 cos ^ 


on s 0, 


where ^ is the angle between PQ and the nornml to * 0. Substituting into (23-7) 

yields the Poisson formula for a half space^ 


u(x,y,z) 



cos ^ 

_ 


d<r 


± r r __. , 

2t [{X - x,)« + (y - yO* + **]« *''' 


This formula represents a harmonic function for « > 0, which reduces to f(x,y) when 

* *■ 0. 


In terms of heat flow, the Dirichlet problem is to compute the steady- 
state temperature in a solid when the temperature on the surface is known* 
Sometimes the rate of heat flow across the surface is prescribed rather than 
the temperature. The problem which arises in this way is called the 
Neumann problem; it leads to the equations 


« 0 in r, 


du 

— =» g on o. 
dn 


(23-8) 


If (23-8) is to have a solution, we must restrict g so that the rate of 
flow into T equals the rate out; otherwise, a steady-state temperature 
cannot be exi)ected. It is clear physically that the appropriate condition 
is 

[gdo-^O (23-9) 

J<r 

and indeed, the choice v » 1 in (29-3) shows that (23-9) follows from 
(23-8). 

When g satisfies (23-9), the problem is still not well posed because it has 
infinitely many solutions. That is, (23-8) involves the derivatives only, 
so that u can be altered by an additive constant. To make the solution 
unique we require that 

f^uda^Q. (23-10) 

Properly stated, the Neumann problem is to solve (23-8) when (23-9) and 
(23-10) hold. 

By means of (20-9) we can develop a Neumann function NiP^Q) ana¬ 
logous to the Green function G(P,Q) of tlie foregoing paragraphs. As 
before the condition 


« 0 throughout r 


(23-11) 
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makes the volume integral (20-9) drop out. To get rid of the surface 
integral involving w, we require that dv/dn be constant ^ on or, and we 
recall (23-10). Since i; « w -f 1/r, this requirement is 


dw d 1 

(23-12) 

— as-f. const. 

dn dn r 

To make w unique we require also that 


/ w da ^ 0. 

-ftr 

The Neumann function is 

N(P,Q) s t) = -f - 

(23-13) 

(23-14) 


r 


where w satisfies (23-11) to (23-13). The solution u of (23-8) can evidently 
be expressed in the form 

u{F) « — f / g(Q)Ar(P,Q) da. (23-15) 

47r 4ir 

When g is given in advance, it can be sho\Mi, conversely, that the function 
(23-15) satisfies (23-8), Hence, if we solve the particular Neumann 
problems involved in the construction of ^(FjQ), we can solve the general 
Neumann problem for the region. 

Physically, the Neumann function represents the heat flow due to a source of strength 
4ir at F when the heat flows out at a uniform rate across the boundary. This shows 
that the condition 

— 0 on (23-16) 

dtl 

analogous to (23-3) cannot be required in general; when the region is bounded, (23-16) 
violates the principle of conservation of heat. For unbounded regions (23-16) is possible, 
as we see by considering the Neumann function for a half plane^ 

mP,Q)-- + ~- (23-17) 

r Ti 

It is left for the reader to verify that (23-17) satisfies (23-16) on the plane s « 0 and 
to solve the Neumann problem. 


ELLIPTIC, PARABOUC, AND HYPERBOLIC EQUATIONS 

24« Classification and Uniqueness. If a, 6, and c are real continuous 
functions of x and y, and if H is a continuous function of the indicated 
arguments, the partial differential equation 

d" 2bZxy *4” H {x^yfZ^ZxjZj^ 

> See remarks at the end rf this sectlom, 


(24-1) 
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includee many equations of mathematical physics. It is oonvenient to 
classify equations of the type (24-1) according to the sign of the diBcriminmJL, 
b® — oc. When 6^ — oc < 0, the equation is said to be elliptic; when 
6^ — oc » 0, it is parabolic; and when — oc > 0, it is hyperbolic. This 
nomenclature is suggested by analogy with the conic 

ax^ -f 2bxy + cy^ ^ //, a, b, c, H const, 

which is an ellipse, a parabola, or a hyperbola according to the sign of 
b^ — oc. 

As typical illustrations, the reader can verify that 

^'xx "t* Uyy ** 0, tix* ~ 'liy, ““ '^uy ~ 0 (24-2) 

are elliptic, parabolic, and hyperbolic equations, respectively. The first 
of these is Laplace’s equation; the second ^ is the equation for heat flow; 
the third ^ describes the motion of waves on a string. The general equation 
(24-1) in the elliptic, parabolic, or hyperbolic case has much in common 
with the corresponding Eq. (24-2), and that is the reason why the classi¬ 
fication is important. We shall now discuss the conditions for unique 
determination of u. 

Case I. Elliptic Equation. Physical considerations suggest that a 
solution of Laplace’s equation is wholly determined by its boundary values. 
That Ls, if U\ and U2 satisfy 

Uxx “f" Uyy =* 0 (24-3) 

in a bounded region r, and if Wi = U 2 on the boundary, then Ui « U 2 in 
r. A mathematical proof is readily given, assuming that the function 
u = — 1^2 is continuous in r and on the boundary. 

Without loss of generality, let the region r lie between the lines x * 0 and x «* 1 
(so that cos X 9^ 0 in r). If r is defined by 

a « t> cos X, 

a short calculation shows that (24-3) yields 

Vxx 4* 2vx tan x — v >• 0. (24-4) 

Suppose that v > 0 at an interior point Pq. Then v assumes a positive maximum at 
an interior point Pi (since r 0 on the boundary and v is continuous). At Pi we have 

i; > 0, Vx * 0, V** < 0, Vyy <0, 

and hence (24*4) cannot hold. This contradiction shows that c < 0 throughout Uie 
region. Similarly, v > 0, and hence r « 0. It follows that m » 0 , as was to be proved. 
The same method can be used in three dimensions; the only change is that (244) 

* Let y in (9-5). 

»Let y "li at in (24). 
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bta &131 axtora term v««. A decidedly less elementary proof (which appliee also to the 
Ketimaxm problem) was given in See. 21» Prob« 2. 

What we have actually shown is that if the harmonic function u satisfies 
w < 0 on the boundary, then the same inequality holds at interior points. 
Considering u — m instead of u yields the following significant result:^ 
Maximum Principle. Let u be harmonic in a hounded region r and let 
m be constant. If u <m throughout the boundary of r, then u < m through-- 
out r. 

This theorem is true for the general equation * (24-1), provided (24-1) 
is elliptic and H >0, 

Case II. Parabolic Equation, Let u be the temperature of a thin rod 
extending from x « 0 to x » Z. With y « aH the equation of heat conduc¬ 
tion is 

Uzx ^ 0 < X < Z, 0 <y < CO. (24-5) 

As typical initial and boundary conditions, we assume that 

u(xfi) mf(x), w(0,y) « g(y), u(l,y) « h{y), (24-6) 

These conditions give the initial temperature and the temperature of the 
two ends. In the zy plane, (24-6) specifies the value 
of u on the boundary of a certain semi-infinite rec¬ 
tangle (Fig. 31). The physical interpretation sug¬ 
gests that u is thereby determined within the 
rectangle, and we shall now show that this is, in 
general, the case. 

Let ui and satisfy (24-5) and (24-6). The function 

U « Ui — U2 

then satisfies (24-5) and (24-6) with f q h ^ 0. For 
simplicity, we shaU suppose that u is continuous and bounded 
in the region of Fig, 31 and on its boundary, though these 
conditions could be weakened. 

If s is defined by m « ve*', substitution in (24-5) yields 

+ V. (24-7) 

Suppose that v ■■ > 0 at some point Pq of the rectangle in Fig. 31. We know that 

e « 0 on the three sides of this rectangle, and since u is bounded, the equation v * 
shows that e < oo if V is large enough. It follows that p assumes a positive maximum 
at some interior point Pi. At Fi, 

< 0 , » > 0 

and hence (24-7) cannot hold. This contradiction shows that p ^0, everywhere. 
Similarly, » ^ 0, and hence r * 0. It follows that ui m u%, 

»a. Sec. 21, Prob. 3. 

*8ee: H. Bateman, ^Tariial Differential Equations of Mathematical PhyMcs," 
p. 135, Cambridge University Press, London, 1932. 
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Th« xnethod of proving this uniqueness theorem le&ds, jti»t ne in the foregoing disoueatoni, 
to a MAJCmini Princifub: If u^m on the houndarif of the redangle in Fig. 31, then 
u ^ m throughout the rectangle. A physical interpretation is readily given. 

PRomj:MS 

1. For what values of the constant k is + ku^ -f *■ 0 eUiptio? Parabolic? 
Hyperbolic? 

2. In what regions of the xg plane is 

(1 -f y)w»* 4- 2xugy 4 (1 - y)ut^ - «« 

elliptic? Parabolic? Hyperbolic? 

8 . Show that the solution of the elliptic equation 

ti» 4 Wyy » —few 

is not always uniquely determined by the boundary values. Hint: Let the region be 
the square 0 ^x<ir, 0<y<v, and separate variables. For a physical interpretation, 
see Sec. 14. 

4 . A characteristic value for a region r is a constant X such that the problem 
Uxx 4 Upy 4 Um 4 Xu « 0 in r, u -« 0 on the boundary 

has a solution other than the tj'ivial solution u » 0. Show that a characteristic value 
is always positive. Hint: If u 0, then u has a positive maximum or a negative mini> 
mum at some interior point P. 

6 . The semi-infinite strip 0<x<ir, y>0 has its edges kept at the constant tem¬ 
perature u — 0, whereas its end j/ « 0 is kept at the temperature u * sin x. In the 
steady state the temperature u satisfies u** 4 * 0 , and also 

u( 0 ,i/) 0 , u(yr,y) « 0 , m(x,0) *» sin x. 

(a) By the method of separating variables obtain infinitely many distinct solutions to 

this problem. ( 6 ) Show that only one of these solutions satisfies lira u(x,y) -» 0 uni- 

» 

formly in x, (r) If condition ( 6 ) is imposed, show that the problem has, in fact, only 
one solution. Hint: Use the maximum principle. 

26. Further Discussion of Uniqueness. Continuing the study of (24-1) 
we consider the hyperbolic equation 

Uxx (26-1) 

Since solutions of (25-1) do not satisfy the maximum principle, the fore¬ 
going methods cannot be used here. 

Case Ilia. Hyperbolic Equaiion^ First Problem. Let the value and 
normal derivative of u in (25-1) be given on an interval (a,b) of the x 
axis (Hg. 32). Thus, 

w(x,0) «= /(x), Wy(x,0) « g{x), a <z <b. 

If G(x) « f g{s) dsj d^Alembert^s formula (3-15) yields an expression 

Ja 

2u{z,y) = fix + y) + G(x + y) + fix — y) — Gix - y), (25-2) 
which will now be used to discuss the uniqueness of u. 
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By hypothesis, f(x) wad G(x) are determined tor a < x <b but not 
outside this interval. Hence 

fix + J/) 4* 0(x + y) is determined for a < a; + y < 6, 

(2<5"3) 

fix — v) ““ y) ^8 determined for a < x — y < 6, 

but not elsewhere. In the xy plane, the loci 

a <x + y <b, a <x-y <b 
represent two strips, bounded by the two pairs of lines 
x + j/«sa, X + y and x -- y ^ a, x -- y ^ b (25-4) 

(Fig, 32). Both expressions (25-3), and hence u in (25-2), are uniquely 
determined in the intersection of these strips, but only there. This shows 
that the region of determinacy is the doubly shaded region in the figure.' 



Similar behavior is found for the general hyperbolic equation, the role of the lines 
(254) being taken by the charactenatics introduced in Sec. 29. It is often possible to 
express u by an integral formula involving the initial value and normal derivative. 
The method requires construction of the Riemann which is in some resjxjcts 

analogous to the Green function of Sec, 23. 

Case III6. Hyperbolic Equation^ Second Problem, The equation 
Ux£ “ Uyy has the general solution 

u(x,y) ^ fiix -y) + f 2 ix + y) (26-5) 

as was shown in Sec. 1. If u is given on two adjacent sides of the rectangle 
in Fig. 33, we shall use (25-5) to show that u is determined in the whole 
rectangle.* 

* Of, Theorem I, Sec. 4. 

^ ^See A. G, Webster, “Partial Differential Equations of Mathematical Physics/' 
p, 248, Teubner Verlagsgeseilschaft, Leipzig, 1927. 

• Cf. Theorem II, ^c. 4. 
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Choose a point B in the rectangle, and draw RQ and RS parall^ to the 
sides of the rectangle, as in the figure. With P the apex of the rectangle, 
r — 1 / is constant on PQ and x — ^ is also constant on SR. Hence the 
same is true of fi{x — y): 

fii^ — y) « a at P and Q, fi(x — y) » ^ at P and S. 
Similarly, 

f2{x + 2/) *= 7 at P and S, f^ix + y) « 5 at Q and P. 

By using these values in (25-5) we can verify the identity 

u{R) « n(P) + u{S) — u(P). (26-fi) 

This shows that u is determined by the data at every point in the rectangle 
and at no point outside the rectangle. 

By a procedure known as Picard^s method, the problem just discussed 
can oft.en be solved for the general hyperbolic equation (24-1). It is 
supposed that the equation has the form ^ 

Uxy == H{x,y,u,UsyUy) (25-7) 

and for simplicity we assume the homogeneous boundary conditions 

u{xfi) = 0, u(0,v) « 0, 0 < X < a, 0 < 2 ^’ < 5. (25-8) 

Thus, u = 0 on two adjacent sides of the rectangle in Fig. 34. The con¬ 
ditions (25-8) enable us to write (25-7) in 
the form 

u(r,y) j^^H{xuy\,u,Ux,Uy) dxi dyi 

where the arguments of w, it*, and Uy in the 
integral are Xi and yi, 

Picard’s method consists of choosing a first 
approximation evaluating the integral, 

and using the result as the second approxi¬ 
mation A similar process yields 

and so on. If is the nth approximation, 
the next approximation is 

f rdxidyi. (25-9) 

i/O •'O 

Subject to mild restrictions on H it can be shown ® that the solution is 
‘ Sec Sec, 29, Case HI. 

*R. Courant and D. Hilbert, **Methoden der mathematischen Physik,” voL 11, 
p. 317, I. Springer-Verlag OHG, Berlin, 1937. 




510 

givQD Iqr 


PAiRiAL mrrsREimAL EquAxioira 


{chap. 0 


By (26-9). 


«(*.») " lira 

As an illustration, let the equation be 

Usy •‘I +u, 

^(n-hi) * ^ ^ |^(»)] ^ xy -i- r rdxi dyi, 

Jo Jo Jo Jo 

Starting with •• 0, we get •• xy, *■ ary + {xy)^/4, 

«<« - xy +J^j^ [*m + ^^1^] dx.%, _ ^ 

and so on. Evidently, the process gives 

uix,y) 

That this is a solution can be verified by actual substitution. 

26, The Associated Difference Equations. Let A be a positive number 
and u = uix^y) a function of x and y. The difference operators Ax and Ap 
are defined by 

u(x -hh.y)- u{x,y) u(z, y + h) - u{x,y) 


(xy)’ 

(3!)* 


^(xy)" 

,.i («!)’■ 


AxU 


(25-1) 


Passing to the limit as A 0, we get the partial derivatives; that is, 

lim AxU Uxf lim ApU «= Up (26-2) 

0 h-* 0 

when the limits exist. If the second differences are defined by 
uix + A, y) - 2u(x,y) + u{x - A, y) 


AxzU *= 


A" 


Attt/u 


u{x, y + h) - 2u{x,y) + w(x, y - h) 


it can be shown, in general, that 
lim AxxU * Ux 




lim AyyU »* Uyp 

h^O 


(26-3) 


(see Prob, 1). Hence the three difference equations 

Ajtxti —" AypU *® 0, Axxd Ayti, Axx"^ "t** AypU *** 0 


as A —♦ 0 become the respective differential equations 

n'xz 'Wyy =* 0, Uxx ~ 0* 

The correspondence o£ difference equations and differential equations is important 
because there are numerical methods of solving the former which are especially adapted 
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to eomputers (cf. Chap. 9, Sec. 19). As -♦ 0, the solution of the dif¬ 

ference equstion generally tends to the solution of the corresponding differential equa¬ 
tion. This fact gives a means of numerical approximation which has been extensively 
exploited. Because of space UmitationB we shall consider merely the determinacy of 
the solutions, our objective being to clarify further the distinction among elliptic, 
parabolic, and hyperbolic equations. 

Case I. Elliptic EqucUim. Using (26-3) the reader can verify that 

-|- AyyU «=» 0 (26-4) 

can be written in the form 

uix^y) » + h,y) + u(x — A, y) + u(x, y + h) + u(x, y — k)l 

(26-5) 

This equation gives a relation between the five values of u at the five 
neighboring lattice points ‘ illustrated in Fig. 35; in fact, the value at ike 
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central lattice point is the arithmetic mean of the values at the four neighbors. 
The corresponding property for Laplace's equation is the average-value 
theorem given in the example of Sec. 21, 

To state the Dirichlet problem for the difference equation (26-4) we 
say that a point is interior to a region if its four neighbors are points of 
the region. A boundary point is a point for which at least one neighbor 
belongs to the region and at least one does not. For instance the points 
• in Fig. 36 are boundary points. In the Dirichlet problem a function u 
satisfying (26-4) is given at every boundary point, and it is required to 
find u at the interior points. We shall now establish both the existence 
and the uniqueness of the solution.* 

Suppose, then, that u is known at every lattice point bounding a given 
region (Fig. 36). If we write down the equation (26-5) for each interior 

* That is, points of the form {mh^nh) with integers m and n. 

* The region is assumed bounded, so that the number of interior points is a finite 
number fi. 
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poiat we obtain a system of n linear equations in n unknowns* It 
will be seen presently that the determinant of this system is not zero, and 
hence there is one, and only one, solution. On the other hand, if the values 
are not prescribed at every boundary point, there are always more un¬ 
knowns than equations, and the solution is not determined uniquely* These 
properties are analogous to those obtained previously for the Laplace 
equation. 

To show that the determinant is not zero, we shall analyze the special case in which 
the boundary values are zero. In this case the system of linear equations obtained by 
w'riting (26-4) at each interior point is homogeneous. If the ddermimnl is zero^ the 
system wiD have a solution other than the trivial solution u « 0. Without loss of 
generality we can suppose that this nontrivial solution u is positive at some point. 

Let the maximum value of u over all the lattice points be denoted by wi >0, and 
let P bo a point where u * w. Evidently P cannot be on the boundary, since w > 0 
at P. Hence, P is interior, and the value of « at P is the average of the values at the 
four neighbors. Now u < m at these neighlxyrs, since m is the maximum. If u < w at 
any neighbor, then tVie average is <m^ so that u{P) < m. This contradiction shows 
that w w m at the four neighbors of P. 

We can now repeat the process, starting with one of these" four neighbors instead of P. 
Proceeding in this way we find that a * m at every lattice point. But that is impossible, 
since u *» 0 m on the boundary. Hence the assumption that the determinant was 
zero led to a contradiction. 


PROBLEMS 

1 . (a) Show that Axxu(x,y) « h, y)]. (h) If u has a Taylor series expan¬ 
sion about the point show that w*, as /i 0. Hint: Use the first six 

terms of u{x 3/ -f ^ a bh ■+• cA -f • • •. 

2. Suppose + AyyW » 0, and suppose u is known for x « 0 and for x ^ h 
(y » h, 2h, 3 / 1 , ., In what region of the ry plane is u determined? I/irit- See Fig. 35. 

3. Let AxxU + Aifj/U =* 0, and suppose u(0,y) «« 1 for all y, u(2h,y) «= 2 for all ?/, 
w(Ai,0) «« u(h,4h) « 0. Find u(h,2h). 

27. Further Discussion of Difference Equations, According to the fore¬ 
going discussion, the elliptic ca.se l(\ads to a set of simultaneous eejuations 
for determination of the unknown function u. In the parabolic and hy¬ 
perbolic cases, as we shall now see, the values of u can be obtained sut;- 
cessively. 

Case II. Parabolic Equation, By (26-1) and (26-3) the equation 

Axxti »= AyU (27-J) 

takes the form 

u{x + h,y) - {2 - h) u{x,y) -f u{x - h,y) ^ hu{x, y + h), (27-2) 

This shows that if is known at the three collinear points in Fig. 37, then 
u can be found at the fourth point. 

By analogy with the problem of heat flow discussed in Sec. 24, let u 
be given at the points • on the boundary of the semi-infinite rectangle 
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in Fig. 38. Referring to Fig. 37, we see that u can be found at tlie latrioe 
points with y *= A, in the rectangle. Repetition gives m for y ■» 2h, and 
so on. Thus, u is determined throughout the rectangle, just as in the case 
of Fig. 31. The process works equ^y well when the rod is infinite and 
u is given at all the lattice points on the x axis. 
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Because the pattern of Fig. 37 points upward, so to speak, it is impossible to proceed 
in the negative y direction when the rod is infinite. The very first step leads to a system 
of infinitely many equations in infinitely many unknowns. Inasmuch as y « where 
t is time, this fact expresses the irreversibility of thermodynamic processes. 

Further insight into the one-directional character of i is given by (9-10) and ^18-4). 
In general, these expressions are infinitely differentiable for f > 0 but divergent for 
t < 0. This behavior of the heat equation contrasts to that of the wave equation. As 
we have repeatedly observed and will see again in the sequel, the latter is meaningful 
for negative t. 

Case Ilia, Hyperbolic Equation^ First Problem. Writing the equation 


AxxU ~ AyyU = 0 


(27^) 


in the form 

u{x + h,y) + u(x - h,y) - u(x, y + h) - u(x, y - h) ^ 0 (27-4) 

we see that the corresponding pattern is tliat 
shown in Fig. 39. If u is given at any three 
of the four lattice points, then (27-4) gives u 
at the fourth point. Inasmuch as the pattern 
is symmetric, one can proceed in the positive 
y direction and in the negative y direction 
with equal ease. 

To discuss the analogue of the initial-value 
problem (Sec. 25, Case IIIo), let u and A^u be 
given in an interval of lattice points on the x 

axis. This is equivalent to specifying w itself on two adjacent rows of lattice 
points, as indicated by the black dots in Fig. 40. Considering Fig, 39 in 
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coBjunetioH with Fig. 40, we see that the region of determination for u 
consists of the lattice pmnts in the square. The analogy with Fig. 32 is 
evident. 

Case IIB. Hyperbolic Equation^ Second Problem, If w is given on two 
adjacent sides of a rectangle as shown in Fig. 41, we can apply Fig. 39, 



starting at P. It will be found that u is determined at the indicated 
points O, and at no others. This behavior corresponds to that found in 
Sec. 25, Case Illfc. 

PROBLEMS 

1. In Fig, 38 let w » 0 on the vertical rows of points •, and w »■ 1 on the horizontal 
row •, for 0 < X < 1. Assuming «« 1 in (27-2) find u{hfih). 

%, In Fig, 40 let w « 1,0,0,2,0,0,3,0,0 on the bottom row of points • (in order), and 
let ■« 0 at these points. Find n(3A,5^), 

8 . Let u(P) «* 0 in Fig. 41. Find the value of u at the opposite comer if u 0 at 
the points • on the left of P, and u » 2 at the points • on the right of P. 

28. An Example: Flow of Electricity in a Cable. Many physical prob¬ 
lems lead to an equation that changes type, 
according to the values of the physical 
parameters. Since the character of the solu¬ 
tions undergoes a corresponding change, this 
phenomenon has great practical importance. 
As an illustration we shall consider the flow 
of electricity in linear conductors (such as 
telephone wires or submarine cables) in 
which the current may leak to ground. 

Let a long, imperfectly insulated cable (Fig. 42) carry an electric current 
whose source is at A. The current is assumed to flow to the receiving end 
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at P through the load B and to return through the ground. It is assumed 
that the leaks occur along the entire length of the cable because of im¬ 
perfections in the insulating sheath. Let the distance, measured along 
the length of the cable, be denoted by x\ then the emf V (volts) and the 
current I (amperes) are functions of x and L The resistance of the cable 
will be denoted by iZ (ohms per mile), and the conductance from sheath 
to ground by G (mhos per mile). It is known that the cable acts as an 
electrostatic condenser, and the capacitance of the cable to ground per 
unit length is assumed to be C (farads per mile); the inductance per mile 
will be denoted by L (henrys per mile). 

Consider an element CD of the cable of length Ax. If the emf is V at 
C and V + AV at I), then the change in voltage across the element Ax 
is produced by the resistance and the inductance drops, so that one can 
write 

dl 


AV 


i if? Ax H- L Ax 

K dt 


) 


The negative sign signifies that the voltage is a decreasing function of 
X. Dividing through by Ax and passing to the limit as Ax 0 give the 
equation for the voltage: 


dV 

dx 


« -IR - L 


dt 


(28-1) 


The decrease in current, on the other hand, is due to the leakage and the 
action of the cable as a condenser. Hence, the drop in current Al across 
the element Ax of the cable is 

dV 

AI - - VGAx -C Ax, 

di 


so that 


dl 

dx 


VG - C 


BV 
dt * 


(28-2) 


Equations (28-1) and (28-2) are simultaneous partial differential equa¬ 
tions for the voltage and current. The voltage V can be eliminated from 
these equations by differentiating (28-2) with respect to x to obtain 

h. « - CF,,. 

Substituting for from (28-1) gives 

« IRG + LGIt ~ CF^, 

from which F«» can be eliminated by using the expression for F^j obtained 
by differentiating (28-1). Thus one is led to 

- LCIn - (LG + RC)U + IRQ- 


(28^) 
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A similax oalculatioa shows that (28-S) is also satisfied by the voltage V, 
Evidently, (28-3) is hyperbolic when LC 9 ^ 0 but parabolic when jDC «= 0, 
When the cable is lossless, R ^ G 0. Equation (28-3) and the cor¬ 
responding equation for V are, then, 

hx « - LCVtt. (28-4) 

Comparing the equation for wave motion (Sec. 1), we see that the cable 
propagates electromagnetic waves with velocity 

a « (LCr^. 

The hyperbolic equation (28-4) is appropriate if the frequency is high and 
the loss is low* 

For an audio-frequency submarine cable it is more appropriate to take 
(J «: i SK 0* The equations are then parabolic: 

hx - zee/,, y.. - Rcvt, (28-5) 

Instead of representing waves, the propagation of V and / is now identical 
with the flow of heat in rods. Comparing wath (9-5) gives 

a = {RCr^. 

Example: Consider a submarine cable I miles in length, and let the voltage at the 
source under steady-state conditions, be 12 volts and at the receiving end jR be 6 
volts. At a certain instant ( * 0, the receiving end is grounded, so that its i:K>tential 
IS reduced to aero, but the potential at the source is maintained at its constant value 
of 12 volts. Determine the current and voltage in the line subsequent to the ground¬ 
ing of the receiving end. 

It is required to find V in (28^) subject to the boundary conditions 

F(0,0 « 12, FftO « 0, e > 0. (28-6) 

The initial condition is 

y(i,0)-12-6j (28-7) 

since the steady-state solution of (28-5) m a linear function of x (Sec. 9, Example 1). 

The voltage F(x,0 subsequent to the grounding can be thought of as being made up 
of a steady-state ^ voltage Vsix) and a transient voltage Fr(x,0 which decreases rapidly 
with time. Thus, 

F(x,0 - Fs(x) + Fr(x,0. (28-8) 

Since Vsix) is linear, its value is given by the boundary conditions as 

rs(z)- 12 ~ 12 j- (28-9) 

Equations (28-6) and (28-7) now yield 

, Fr{ 0 ,t) - Vt(IA - 0 , rr(*. 0 ) - j- 


1 Compare Sec. 9, Eicample 2. 
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Sino6 Vt satisfies (2S-fi), w« can tise the solution of the heat equation (9^10) with 6? m 
1/jRC, The result is 

TrfeO ** £ f7 f 7 351 shi e^C/acXttir/o*! gjjj !!!^. 

n»»l \t Jo i I / I 

The function V is now given by (28-8). 

PROBLEMS 

1. By using (28-1) with L 0, find / in the Example. 

2. Find the ©mf in the cable whose length is 100 miles and a^hose characteristics are 
as follows: R « 0.3 ohm per mile, C « 0.08 nf per mile, L « 0, fi? « 0, If the voltage 
at the source Is 6 volts and at the terminal end 2 volts, what is the voltage after the 
terminal end has been suddenly grounded? (Use (28-5).j 

8. Using (28-5), find the current in a cable 1,000 miles long, whose potential at the 
sourc^j, under steady-state conditions, is 1,200 volts and at the terminal end is 1,100 
volts. What is the current in the cable after the terminal end lias been sud¬ 
denly grounded? Use i? « 2 ohms per mile and C »» 3 10"’^ farad per mile. 

29. Characteristics and Canonical Form. If a, b, c are continuous func¬ 
tions of X and with a 5 *^ 0, then the partial differential equation 

auxx + 2 buxy + cuyp = H(x^y,u,Ux,Uy) (29-1) 

can be simplified by use of the equation 

a dy^ — 26 dy dx + cdx^ ^ 0 . (29-2) 

Setting dy = pdx in (29-2) and solving the resulting quadratic give 

p = ^ or p — a'“n 6 — ( 6 ^ — ac)^]. (29-3) 

Since p = dy/dxj Eqs. (29-3) are ordinary differential equations of the 
first order, and hence the solutions may be expected to contain an arbitrary 
constant c. If the solutions are written in tlie form 

.Y (x,y) = c or F(x,^) = c, (294) 

the resulting curves (29-4) are called the characteristics of (29-1), 

For example, when (29-1) is the wave equation 

a^Uxz — « 0 (29-5) 

the differential equation (29-2) is 

o® dt^ — « 0, 

Since this reduces to dx/dX « dza, the characteristics are the straight lines 

X— 

It was shown in Sec, 1 that the change of variable 

f X — « « a; -f at, u(a:,t) « U(r,«) 
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reduces (29^) to the form Ur^ 0» aud a physical interpretation of the charactexistios 
was given in Sec. 4. 

Equation (29-1) is said to be in canonical form if it has one of the three 
forms 

where is a function of x, u, u^y and Uy, It is a basic fact that the 
reduction to canonical form can be achieved by means of the characteristics, 
and we shall now describe * the procedure. 

Casjs I. Elliptic Equation, When b^ — ac < 0, tlie two values of p 
in (29-3) are conjugate complex, and hence the same is true of X and Y 
in (29-4). That is, 

X « r(x,y) + is{x,y), Y =« r{x,y) - is{x,y) 

where r and a are real. In this case the reduction can be achieved by choos¬ 
ing r and a as new independent variables. If u{x,y) = U{r,s), Eq. (29-1) 
gives an equation for U in which the second derivatives occur as Urr + 

Case II. Parabolic Equation. When 5^ — oc = 0 the two values of 
p in (29-8) are real and equal. Hence the same is true of X and Y in 
(29-4). In this case the reduction can be achieved by the change of vari¬ 
able 

r » X(x,y), a « any function independent of X, 

The second derivatives of U now occur as 

Case III. Hyperbolic Equation. When 6^ — ac > 0, the roots (29-3) 
are real and unequal, and the same is true of X and Y. The reduction is 
achieved by taking 

r » X(z,y), a = Y{x,y) 

as new independent variables. The second derivatives of U occur only 
as Ur$^ 

To illustrate the procedure we shall consider the equation 

Uxx kxuxy “h 4iX^Uyy ■■ 0 (29-6) 

when A: 0, 4, or 6. According to (29-3), 

p « ± - 4x*)^. (29-7) 

When fc — 0, this gives p « =b2tx. The equations db2ix have the solutions 
y -■ ix* « c, y -h tx® «» c 

* A proof may be found m A. G. Webster, “Paitial Differential Equations of Mathe¬ 
matical Physics,'' p«242, Teubner Verlagsgesellschaft, Leipzig, 1927. 
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where e is oonstaot. Taking real and imaginary parts, 

r « V, s - **. 

With tt(x,y) -« U(r^8)^ the deriyatives are 

« 4a;*r7gg -f- 2f/,, u»y » 2xUtrt ^/y ^ Urr 

and substitution into (29-6) with A; -» 0 gives the canonical form 

Urr + U., - -(2x*)-iC7. « H2sr^U,. 

Wlien fc «» 4, the two roots (29-7) are both p - -“2a;. Solving this differential equa¬ 
tion we see that (29-4) is 

y -f X* *« c, y -f X* — c. 

Since y -f x* and y are independent, we can take 

r « y *f X*, « « y. 

It is left for the reader to show that the canonical form is 


U,, « -(2x^r^Ur « ~(2r - 2$rWr, 


Finally, the case A « 6 leads to two distinct real roots p 
p « dy/dx and solving, 

V + Hr’ “ C, y + 2z* - c. 


The change of variable 

r «« y 4- « « y 4- 2x* 

now leads to the canonical form 

Vr, « (6« - err\Ur 4“ 4C/.). 


P 


(29-S) 
—4x. Setting 


(29-9; 


PROBLEMS 


1. Derive (29-8) and (29-9). 

2. Describe the behavior of the characteristics of (28-3) as LC varies from uero to 
infinity. 

8. Reduce to canonical form 

3ttay « U*, 4" 2Uyy, 2Uxy « 4- UyVf * W*X *f 2\iyy, 

30. Characteristics and Discontinuities. The function u ^ f{x -- at) 
represents a wave propagating in the positive z direction with velocity a. 
If /(x) has the form shown ^ in Fig. 43, the motion exhibits a w<we front 
(Fig. 44), whose locus can be found by setting the argument off equal to c: 

X -- af c. 

In the xt plane, we recognize that this equation describes a characteristic 
of the wave equation (29-5). 

> The intent is f(x) - 0 for x > c, f(c) « 0. /'(c-h) - /"(c-) 3^ 0. 
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Discontinuities of the type just considered arise in many investigations, 
ranging from the theory of the cracking of glass to the theory of super¬ 
sonic flight. The locus of the discontinuity is always a characteristic, as 
we shall presently see, and hence, the foregoing example is typical of the 
general case. 



c » 

Fig. 43 



Since the locus is a characteristic, a discontinuity of the type in question 
may arise on two families of curves for hyperbolic equations, it may arise 
on one family for parabolic equations, and it cannot arise for elliptic equa¬ 
tions. For example, the equations of fluid flow are elliptic at velocities 
less than tlie velocity of sound in the fluid. But at velocities exceeding 
the velo<*ity of sound the equations become hyperbolic, and the fact that 
a discontinuity is now possible permits the formation of a shoc/c wave. 

To discuss these questions mathematically, consider a solution surface 
u «= u(x,y) satisfying (29-1). We suppose that v is continuous and has 
continuous first derivatives but has a discontinuity in one of the second 
derivatives on a certain curve C. This single solution is to be regarded as 
two solutions wliich are tangent along the curve C hut 

do not have equal second derivatives along C. We let the surfaces de¬ 
fined by Ui and extend past C, so that their first derivatives are well 
defined on C. 

The symbol ( ) denotes the jump of the function in the parentheses; 
that is, 

(w) — ui — U 2 evaluated on C. (30-1) 


Differentiating (30-1) with respect to x gives 

(Wx) - Uu - tl2x = (u)x (30-2) 

and similarly for other derivatives. Our hypothesis is that 


(w) — (Wx) 0 


(30-3) 


but that one of the quantities (u*v), or (uy^) is not zero. 

Differentiating the relation (w*) *= 0 with respect to z by the chain 
rule yields 

(Uxx) + {u^y)y' = 0 (30-4) 

when we recall that y is function of x on C and take due account of 
(30-2). Similarly, differentiating (uy) = 0 with respect to x yields 


iUyx) + (Uyy)y' ^ 0 . 


( 30 - 5 ) 
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Taking the ( ) (rf the partial differential equation (29-*l), we get 

aiUxz) + 2b(Uxy) + C(Uyy) ^(11)^0 (30-6) 

when H is continuous. The fact that (H) ^ 0 follows from (30-3) if we 
recall that H does not involve the higher derivatives of u. 

Equations (30-4), (30-5), and (30-0) are three linear homogeneous 
equations in the three unknowns (w**), (uxy) == (uyx)^ and (uyy). By 
hypothesis not all these unknowns are zero, and hence, the coefficient 
determinant must vanish: 


1 V' 0 

0 1 y' 

a 2b c 


0 . 


(30-7) 


Expansion of this determinant yields the characteristic equation (29-2), 
so that C must be a characteristic curve. Conversely, if C is a character¬ 
istic, the determinant (30-7) is zero, the related homogeneous equations 
have a nontrivial solution, and a discontinuity is possible. 

Example. Fundamental Soltdiona. A fundamental solution of a partial differential 
equation is a solution of the form /lX(x,y)], when X is a Jixed function and / an arbitrary 
function. For example, the equation u^x bas the fundamental solutions 

fi(x - y) and / 2 (x + y) {30S) 

in wliich X « x — y and X » x -f y, respectively. We shall now see that if (29-1) has 
the fundamental solution /[X(x,y)l, then the curves 

X(x,y) » r, c const (30-9) 

are characteristics. For proof, it suffices to choose the arbitrary function / so that f\x) 
is continuous except at x » r. Then the function u « /lX(x,y)] has a discontinuity of 
the type previously considered on the locus X(x,y) « c, and the desired result follows. 

This explains why the technique's used in St'cs. I to 6 to study the wave equation are 
not applicable to Laplace’s equation. Namely, d'Alembert’s method is based on the 
fundamental solutions (30-8) and the Laplace equation, l)eing elliptic, has no such 
solutions. 
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This chapter contains a concise presentation of the rudiments of com¬ 
plex-variable theory with an indication of its many uses in the solution of 
important problems of physics and engineering. This theory, with roots 
in potential theory and hydrodynamics, is among the most fertile and 
beautiful of mathematical creations. Its unfolding left a deep imprint on 
the whole of mathematics and on several branches of mathematical physics. 
To an applied mathematician this theory is a veritable mine of effective 
tools for the solution of important problems in heat conduction, elasticity, 
hydrodynamics, and the flow of electric currents. 


ANALYTIC ASPECTS 

1. Complex Numbers. The analysis in the preceding chapters was 
concerned principally with functions of real variables, that is, such vari¬ 
ables as can be represented graphically by points on a number axis, say 
the X axis of the cartesian coordinate system. The reader is familiar with 
the iact that calculation of the zeros of the function f(x) = ax^ 4 -f c, 
when the discriminant — 4ac is negative, necessitates the introduction 
of complex numbers of the form u + iv, 
where u and v are real numbers and i is a 
number such that * —1, 

A number of the form u + iv can be repre¬ 
sented by a point in a plane referred to a pair 
of orthogonal x and y axes if it is agreed that 
the number u represents the abscissa and v 
the ordinate of the point (Fig. 1). No con¬ 
fusion is likely to arise if the point (u,r), asso¬ 
ciated with the number u -b iv, is labeled 
simply u + iv. It is clear that the point {u,v) can be located by the 
terminus of a vector z whose origin is at the origin 0 of the coordinate 
system. In this manner a one-to-one correspondence is established 
between the totality of vectors in the xy plane and the complex numbers. 
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The vector z may be thought to represeut the resultant of two vectors, 
one of which is of magnitude u and directed along the x axis and the other 
of magnitude v and directed along the y axis. Thus, 

z u + iv, 

where u is spoken of as the real part of the complex number z and v as 
the imaginary part. Therefore, if the points of the plane are referred to 
a pair of coordinate axes, one can establish a correspondence between the 
pair of real numbers (u,v) and a single complex number u + iv. In this 
case the xy plane is called the plane of a complex variable, the x axis is 
called the real axis, and the y axis is called the imaginary axis. 

If V vanishes, then 

2 « w + O't *= u 

is a number corresponding to some point on the real axis. Accordingly, 
this mode of representation of complex numbers (due to Gauss and Argand) 
includes as a special case the usual way of representing real numbers on the 
number axis. 

The equality of two complex numbers, 

a + ib ^ c + id, 

is interpreted to be equivalent to the two equations 
a =» c and 6 « d. 

In particular, a + %b 0 is true if, and only if, a ** 0 and 6 « 0. 

If the polar coordinates of the point {u,v) (Fig. 1) are {r,B), then 

u r cos 6 and v *« r sin 0 

so that 

r » -f and B * tan“"^ 

u 

The number r is called the modulus, or absolute value, and 6 is called the 
argument, or phase angle, of the complex number z ^ u + iv. It is clear 
that the argument of a complex number is not unique, and if one writes 
it as 0 + 2kT, where 0 < ^ < 2ir and k 0, d=l, dh2, ..., then B is called 
the principal argufnent of z. The modulus of the complex number z is 
frequently denoted by using absolute-value signs, so that 

r = \z\^\u + iv\ « Vti^ + v^, 

and the argument B is denoted by the symbol 

B « arg z. 

^ The student is assumed to be familiar with the fundamental algebraic 
operations on complex numbers, and these will not be entered upon in 
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detail here. It should be recalled that (cf. Chap. 2, Sec. 15) 

zi -f 23 = (xi + m) + (si + m) “ (*i + *2) + iivi + 2/2), 

Zl-Z2 = (Xi + tVl)(X2 + iVi) = ( 2 : 1 X 2 - ViVi) + i{XiJ/2 + X2|fi), 

2 i Xi + iyi X1X2 + ViVs . *2^1 - * 1^2 

•— ass ... - .I... . .. . seat . . ..i — . r .i..... _l_ ^ . . . * " 2 

Z2 X2 + iy2 4 + ^ + 2/2 

provided that l^al * Vx| + 2/2 5*^ 0- 
On representing complex numbers zi and Z 2 ^>y vectors, we can see at 
once from Fig. 2 that they obey the familiar “parallelogram law of addition” 
formulated in Chap. 4, 



From elementary geometric considerations we deducjc that 

l^Ji + ^2l<|2i! + l^2i; (W) 

that is, the modulus of the sum of two complex numbers is less than or equal 
to the sum of the miKiuli. Tliis follows at once from Fig. 2 on recalling that 
the sum of two sides of a triangle is not less than the third side. 

Also, 

121 + ^^21 >1^11-1^21; (1-2) 

that is, the modulus of the sum is greater than or equal to the difference of the 
moduli. This follow’s from the fact that the length of one side of a triangle 
is not less than the difference of two other sides. 

Equations (1-1) and (1-2) yield a useful inequality, 

|2li-|22i<|2l -«2l<l*lH-|*2l, (1-3) 

indicated in Fig. 3. 

When calculations are carried out with complex numbers, the notion of 
the conjugate complex number is useful. We define the conjugate I of the 
number z « x -f- ty by the formula 

S “ X — ty. 
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The application of the rules for addition, multiplication, and di^dsion 
of complex numbers yields the following theorems: 

(®) + ^2 ** + ^2, (1-4) 

or, in words, the conjugate of (he sum of (wo complex numbers is eqtud to the 
mm of the conjugates; 

(b) ^ = 2il2, (1-6) 

that is, the conjugale of the product is equal to the product of the conjugates; 



or (he conjugate of the quotient is equal to the quotient of the conjugales. 

We note that if I =* z, then z is real. 

The geometric interpretation of multiplication and division of complex 
numbers follows readily from polar representation of complex numbers. 
Thus, 

Z 1 Z 2 «» ri(cos + t sin 0 i)r 2 (cos $2 + i sin ^ 2 ) 

rir 2 (cos (^1 + 62 ) + i ®in (^i + ^ 2 )]- (1**7) 


That is, the modulus of the product is equal to the product of the moduli and 
the argument of the product is equal to the sum of the arguments. 

Also, 


«2 


ri(cos<?i + i sin ^ 1 ) 
r 2 (cos $2 + i sin 62 ) 


~ — [cos (^1 ~ ^ 2 ) + i sin (^1 ~ 62 )]^ 


( 1 - 8 ) 


as follows on multiplying the numerator and denominator in (1-8) by cos $2 
t sin ^ 2 * Thus, the modulus of the quotient is the quotient of the moduli 
and the argument of (he quotient is obtained by subtracting the argument of the 
denominator from that of the numerator. 

On extending formula (1-7) to the product of n complex numbers 


we get 


Zk * rifc(cos i sin Sk), fc « 1, 2, ..., n, 


^1^2 -. * ’^1^2* • .rnicos {Bi + $2 -H ^n) + ^ sin (^1 + ^2 H-h ^n)] 

and, in particular, if all zs are equal, 

z^ as [r(cos ^ i sin 5)]" « r’*(cos -f t sin n$). (1-9) 

Formula (1-9) is known as the de Moivre formula^ BXid we have shown that 
it is valid for any positive integer n. We can i^ow that it is also vaUd 
for negative and fr^tional values of n. 
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Indeed, from (1-8) we deduce that 


1 cos 0 “h i sin 0 1 

^ * -— ; : « - [cos + i sin {-$)], 

z r(cos ^ 4 -1 sm d) r 

and since (1-9) is known to hold for positive integers n, 

zm Z'^ a* [cos ( — ^) + 1 sin (—^)]* 

» r"^[co8 (—71^) + isin (—n^)]. 


This establishes the result (1-9) for negative integers n. 

To prove the validity of (1-9) for fractional values of n, it suffices to 
show that it holds when the integer n is replaced by 1/n, for on raising 
the result to an integral power m, we obtain the desired formula for frac¬ 
tional exponents. 

Let 

W m mi ^ ^ 

so that is a solution of equation 

as z. 

On introducing polar representations, 

w =« 12(cos ^ -f i sin ^), 
z «= r(cos 0 + i sin 9), 


(WO) 

(Ml) 

(W2) 


where 0 is the principal argument of z, we can write (1-11) with the aid 
of (1-9) as 


n jR”(cos ritp + i sin ?i(p) =*= r(cos & + t sin 0 ), 
We conclude from this that 

fi” sa r, ^ ± 2 kir, A; *= 0, 1, 2, .. 


and thus 
Hence, from (1-12), 


n/~~ ^ dz 2 Ajt 

R tm wr , ^ =-- fc *= 0, 1, 2, ,. 


n 


nr ( ^ ± 2ife7r 6 ± 2kic\ 

tt; SSI V r 1 cos-h i sin ■ 

V n 


b 2 fcir\ 

T~j’ 


and on recalling (1-10), we see that 

,, ., 9 ±2kir . e± 2kir\ 

giln fr(co8 0 + i sin (cos-1 -1 sm-)• 

\ n n / 


(1-13) 
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Since cos {6 ± 2fcir)/n and sin (9 ± 2fcir)/n have the same values for 
two integers k differing by a multiple of n, the formula (1-13) yields just 
n distinct values for namely, 




fl/n 



e + 2 kr 
n 


4 - isin 


e + 2 kr\ 


fc » 0, 1, 2, ..n — 1. 

(144) 


The validity of formula (1-9) for fractional values of n follows directly 
from (1-14) upon raising to an integral power m. 

We illustrate the use of formula (1-14) by two examples. 

Example 1. Compute In this case r « 1 and its principal argument d * 0. 
Formula (1-14) then yields 

n/- 2A;ir . . 2Afir , . « 

V1 « cos-h t sin- 1 A: •* 0, 1, ...» n — 1. 

n n 


If we plot these n roots of unity, we see that they coincide with the vertices of a regular 
polygon of n sides inscribed in the unit circle, with one vertex of the polygon at z 1. 
Figure 4 shows this for n *« 6. 

Example 2, Find all roots of + i. Since 1 -h » *“ \/2 [cos (t/4) 4“ i sin (t/ 4)], 
formula (1-14) gives 






•f i sin 


(ir/4) 4- 2fcr\ 
3 / 


A: » 0, 1, 2. 


Thus the desired roots are 

wi « -C^(cos -f t sin Ha*-), 
tt>2 “ -^(cos 4*»sin K*r), 
wz - ^y 2 (co& ^Ha*^ -f tsin ^Ha*")- 
These roots are represented in Fig. 5. 
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The reader unekilled in simple calculations involving complex numbers 
is urged to work out the representative problems in the following list before 
proceeding to the next section* The symbols Re (z) and Im (z) used in 
some problems in this list denote, respectively, the real and imaginary 
parts of a complex number s. 


PROBLEMS 


1. Find the moduli and principal arguments of the following numbers, and represent 
the numbers graphically: 


(a) 1 + jS/ 3, (b) 2 + 2i, (c) -2, (d) i», (e) 


J 


(/) 


1 +t 


(b) (1 - i)‘. 

2. Write the following complex numbers in the form a + 6i; 


1 +»•’ 1 -i’ (, _ VS)’’ 


(ff) 


(a) (1 - V3 lY, (b) 


(1 + i)’ 


W- 




1 - i ''' 1 + i 

8. Find the cubes of the following numbers: 

(a) 1, (h) H(-l + iVS), (c) H(-l - fVl). 

4. Find the cube roots of t, and represent them graphically. 

6. Fin<i all solutions of the equation + I »« 0. 

6. Verify that <e^ — 22 -f 2 0 has the roots 2 *» 1 ± t. 

7. Compute and represent graphically the following numbers: 

(a) I, (6) (c) ^1, (d) ‘ 

8. Find all the fifth roots of 1 -f i, and represent them graphically. 

9. Use de Moivre’s formula |r(cos6> -f ? sm^))” » r”(cosn0 + i sin n$) to o'^iain 
c ‘08 20 «* cos^ 0 — sin^ 0 and sin 20 » 2 sin 6 cos 0. 

10. Write the following numl>ers in the form a -j- hi: 

(a) VI, (6) Vl - t, (c) ■ ^ 


Vi + »' 

11 * Prove that (a) 2i -f 22 ** ^2 *f W ziZi » ZiZ\, (c) zi{zi -f 23) 

12 . Show that if 2122 « 0, then zi * 0 or 22 “ 0 . 

18. Prove formulas (1-4), (1-6), and (1-fi). 

14. Find |z|, iJ, Re ( 2 ), and Im (z) for the following: 

(o) * - 1 - 2f: (6) * - 3 + 4i; (c) * = \-rT ’ 

3 + 4t 

16. Show that (a) tz ^ — 12 , (6) |?| =« | 2 |®. 

16. What is the locus of point<8 for which 
(a) |z| - 17 (6) |z| < 1? (c)|*|>l? 

Bint: ] 21 »» -\/ 

17. If 2 » X 4* Wt what is the locus of points for which 
(a) Re ( 2 ) > 1? (6) Im (z) > I? (c) Re (z*) « 1? 

18. If 2 « X -f Wj describe the loci: 

1 I 2 — 1 j 

(a) 1 2 — 11 «» 2; (6) — »» const; (c) —« const, 

{zi 12 + 11 

19. Under what conditions does one have the relation 

(a) Izi -h *2! « l«il *f |«2|? ( 6 ) \zi -f z%\ *• M - I22I? 

20. If * « X + iy^ write the following in the form u + iv: 

(o) Y, (6) -, (c) —5- .{d)Y + t-l,(e)~ 


*1*S + *1*». 
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2« Fonctions of a Compltx Variiible» A complex quantity z ^ x + iy 
in which x and y axe real variables is called a complex variable. We shall 
speak of the plane in which the variable z is represented as the z plane. 
If in some region of this plane for each z ^ x + iy one or more complex 
numbers to » u + w are determined, we say that to is a function of s 
and write 

w u + iv ^ f(z). 

Thus, to « ^ + i2xy « (x + iy)^ « 

is a function of z defined throughout the z plane. Also, 
w^u + iv^x—iy^i 


is a function of z. In fact, every expression of the form w(x,y) + iv(x,y) 
in which u and v are real functions of x and y is a function of 2 , since x s 
+ i) and y s (l/2i){z — z) are functions of z. 

A complex function to = f{z) is single-valued if for each 2 ; in a given region 
of the z plane there is determined only one value of to. If more than one 
value of to corresponds to Zy the function w « f(z) is multiple-i>alued. Thus 

to « — 2 ^ + i2xy « s? 

and to « — i2xy « ^ 

are single-valued functions of z. The function to « \/z for each 2 5 »^ 0 
determines two complex numbers, for on setting z *= r(cos ^ + t sin $) 
and recalling formula (1-14), we get 


so that 


to SB 

W\ *= 

t02 « 


(' 


e + 2fcir e + 2kit' 

cos-h i sin — 


)■ 


/ ^ 

I cos —I- t sm - I' 

V 2 2/ 

l^cos ^2 ^ ^ ^ 


fc = 0, 1, 


Thus to =» \/z is not single-valued. 

The fimctions in the foregoing examples are defined throughout the 
z plane. The function to = I /2 is not defined at the origin 2 « 0, while 
to w l/(| 2 l — 1) is not defined when I 2 I « 1, that is, when the points 2 
lie on the circle of radius 1 with center at the origin. 

Of course, w »« J{z) may be defined by different formulas in different 
regions of the plane, or it may not be defined at all in certain regions. 

In dealing with regions of the 2 plane we shall distinguish interior points 
from those that lie on the boundaries of the region. A characteristic prop¬ 
erty of the interior points is that about each interior point P one can draw 
a circle with center at P and with nonzero radius r so small that the circle 
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contains only those points that belong to the region. The points cm the 
boundary of the region are not interior because every circle with the 
boundary point as its center includes points that do not belong to the 
region. 

A region consisting only of interior points is said to be open. An ex¬ 
ample of such a region is the circular region w^hose points z satisfy the 
condition \z\< R. Wlien the boundary of the region is included in the 
region, the region is called closed. An example of a closed region is the 
region consisting of points z such that \z\< R. 

If every point of the region is at a finite distance from the origin, the 
region is said to be finite or bounded. Thus all points of the bounded region 
lie within a circle 1^1 - ft if the radius R is chosen sufficiently large. The 
region consisting of all points in the z plane is unhoundM, and so is the 
region consisting of the points satisfying the condition \z\> 1. 

A plane region is simply connected if every closed curve drawn in the 
region encloses only points of the region. Thus, a region bounded by an 
ellipse is a simply connected region, while a region bounded by a pair of 
concentric circles is not simply connected. A region that is not simply 
connected is called multiply connected. 


PROBLEMS 


1. Express the following functions in the form u(x,y) -f : 

{«) *^ - . + 1, (6)W (d) (e) ,(/). + - *), to) i, 

^ t + 2 

2. Describe the regions in the z plane defined by the following conditiona: 
(o) Re W < 3; (6) Im W > 1; (c) ls| > 1; (d) I < [si < 2; (c) |* - 1 [ < 1; (/) 1* - s,| 

U + tl>2. 

3. Elementary Complex Functions. In Sec. 1 we defined the operations 
of addition, multiplication, division, and root extraction for complex 
numbers. The^e suffice to determine, for any z, values of such algebraic 
expressions as 

aoz"^ + d-ho 


w = 


bo^^ + biz^ 1 . -j- 5 


in which the powers m and n may be integers or fractions. However, they 
do not provide direct means for defining the complex counterparts of the 
real elementary transcendental ^ functions c*, sin x, log x, tan“”^ x, etc. 

* A variable w satisfying the equation F(ZfW) *■ 0, where P is a polynomial in g and 
w, is called an algebraic function of z, A function that is not algebraic is called tranecei^ 
dental. The trigonometric and logarithmic functions and their inverses are called sfe- 
menlary tramcendmUd funcHonz, 
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A uaeliil of a comidex lonctioni saoh m for example, must 

^pedbdise to ^ when z assumes real values. Also, it is desirable to pre* 
serve the familiar law of exponents » en+% 

A definitive formula for e* tliat fulfills these criteria is 


e* « sa e*(co8 y + t sin y). (3-1) 

Moreover, as we shall presently see, it suggests sensible definitions for 
all the other elementary transcendental fxmctions. We note first that for 
X 0 the definition (3-1) yields 

« cos y + f sin (3-2) 

On replacing y by —y we get 


e”'**' » cos y — t sin y. 

Adding and subtracting (3-2) and (3-3) we get the Evler formulas 
cosy « 3^(e'*' + e~^), 

1 

sin j/ » — (e**' — e **'). 


(3-3) 


(3-4) 


Ihese formulas suggest that we define the trigonometric functions of z 
as follows: 


cos r «= - (e** + € *•), 
2 


smr 




sinz 

tan z --» 

cos 2 


1 1 1 

cot Z » -- sec 2 =* -♦ CSC 2 s* - 

tan 2 cos 2 sin z 


(3-6) 


Using these definitions it is easy to check ^ that all the familiar formulas 
of analytic trigonometry remain valid when real arguments are replaced 
by the complex ones. For example, 

sin^ 2 + cos^ 2 « 1, 


sin (zi + Z 2 ) « sin 2 i cos 22 + cos Zi sin 22 , 

and so on. 

The logauithm of a complex number 2 is defined in the same way as in 
real variable analysis. Thus, 


means that 


** log 2 
2 « e^, 


(3-6) 

(3-7) 


^ See Frob. 1 at tlie end of this seoUon. Also, el. alternative definitions of < sin s* 
and cos 2 in Sec. 17, Chap. 2. 
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where e h the baee of imtural logarithms. Setting w m u + w m (3*7) 
gives 

« tt » e**(coB i> + ^ sin v) (3*8) 

by (3-1). On the other hand, we can write z as 

z ta X + iy ^ r(cos 0 i sin 6 ), 

so that (3-8) gives 

r(cos 6 + i sin 8 ) «* €**(cos v + i sin v). 

It follows from this that 

= r, V ^ 6 + 2kiz, « 0, dhl, d=2, .... (3-9) 

Since u and v are real, we conclude from (3-9) that u Log r, where the 
symbol Log is used to denote the logarithm encountered in real-variable 
theory. We can thus write (3-6) in the form 



tr *= w + » log z Log r -f (^ + 2fcir)f 

(3-10) 

or 

1 y 
log z = - Log (r^ + y^) + i tan~^ -» 

2 X 

(3-11) 


since r « Vx* -f- and 8 + 2k^ =« tan^^ (y/x). 

Thus log 2 has infinitely many values corresponding to the diSTerent 
choices of the arguments 6 of z. Setting fc « 0 in (3-10) and assuming 
that 0 < 0 < 27r, we get a single-valued function 


log z « Log r + St, 0 < 8 < 2ir, 


which is called the principal value of log z. If z is real and positive, the 
principal value of log z equals Log r. 

The definition (3-10) serves to define complex and irrational powers c 
of tlie variable z by the formula 

^ log» (3^j2) 


which is equivalent to the statement that log z® = c log z. Inasmuch as 
log z is infinitely-many-valued, it follows that z^, in general, is an infinitely- 
many-valued function,^ The hyperbolic functions of z are defined by the 
formulas 

1 _ 1 _ sinh z 

sinh z « (6* — 6 *)> cosh z « - (e* + e *), tanh z »« —;—t 

2 2 coshz 


sechz 


1 


coshz 


cschz 


1 


sinhz 


(3-13) 


^ Note, however, that i® is singlo-valued when c is an integer. 
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11iP0e functions are clearly single^valued. The inverse trigonometric and 
inverse hyperbolic functions are defined in the same way as in real-vari¬ 
able analysis, and they are multiple-valued.' 

Emmpk 1. Compute 

On setting x • 1 and y « —1 in the formula (3-1) we get 

— e[cos ( —1) + i sin ( — 1)) 

• e(cos 1—1 sin 1). 

Since COB 1 - 0.54030, sin 1 * 0.84147, and e -> 2.718, 

» 2.718(0.5403 - i0.8415) 

- 1.469 - 1*2.287 

to three decimal places. 

Example 2. Compute sin (1 — i). 

Since 

sin « « i (e‘* - 

and * «• 1 — i, we have 

« M »(1 -») -«-*-*) 

jSt 


2t 


1 -f mn 1) — «*‘'[co8 (—1) -f i sin ( — 1)11 
e 4* 


2i 


■ cos 1 ■ 


2 


“Sin 1. 


We can obtain the same result by making use of the addition formulas of trigonometry. 
Thus, 


dn (1 — i) *» sin 1 cos (—0 4 cos 1 sin (—i) 


But by (3-5) 


a> sin 1 cos i — cos 1 sin i. 
cos i ^ («"'' 4“ «'), sin i « i (e~~' — «'). 

2 2i 


Substitution In the foregoing formula yields the result obtained from the definition of 
sins. 

Example 3. Compute log (I 4- 1 ), 

Since 1 4- ♦ VS [cos (ir/4) 4-»sin (ir/4)], 


tog (1 + ») - Log V2 + (j + 2kir^ i, fc - 0, ±1, ±2. 


by (3-10). The principal value is got by setting Ic « 0. 
Example 4. Compute 2^. 

By (3-12), 

2 » - 

*See Probs. 7 and 8. 
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But log 2 «• Log 2 + i2itk, Heuoe 

2 < ^ 2 . .... 
Example 5. Compute iK 
By (3-12), 

But log t «• Log 1 4- fl(ir/2) 4* Zkr] • i|(x/2) -f 2fcir]» and hence 
t* « g~<x/2+aihr)^ ifc « 0, =hl, d=2, .... 

Example 6. Find all solutions of the equation cos « — 2 *■ 0. 
We have cos a ■« 2» which gives, successively. 



2 


«“ + e~" - 4, 


«*“ - 4«“ + 1 - 0. 

Solving for 

4dbVl6-4 


^ 2 


** 2 db \/3* 

Hence 

ia « log (2 4 \/3) 

and 

a w ^ log (2 4 \/3 ). 


t 


Since log (2 db \/3) is infinitely-many-valued, there are infinitely many values of s. 

PROBLEMS 

1. Verify the following: (a) *» e*i^**; (b) sin^a 4 cos^a — 1; (c) cos (zi 4* aj) 

» cos zi cos Z 2 — sin Zi sin aj; (d) cos iz « cosh a; (e) sin iz » i sinh a. 

2. If a and b are real integers, show that » r®e“^[cos (od 4 6 Log r) 4- 

i sin (a0 4 b I-og r)). 

8. Compute (a) cos (2 4 t), (6) (c) (1 4 1 )\ (d) 2^+*, (e) 

4. Express in the form a 4 bt, where a and b are real: (a) l/(z — 1), (b) l/(a* 41 ), 
(c) sin (1 + t), (rf) e*’, (e) e‘/*. 

6. Find the principal values and represent the numbers graphically: (a) log (—4), 
(fc) log (5i), (c) log (1 + 0, (d) log i, («) »■*, (/) e’+‘, (g) sin 2t. 

6. Find all solutions of the following equations: (o) c* 4 1 « 0; (6) sin a 2 »» 0; 
(c) cos*™^ a « 2; (d) cos a^— 1 » 0. 

7. The inverse functions are defined as solutions of the equation a * f(w) for w in terms 
of a. Thus, w « sm~^a if a ■■ sinix? «« (e*^ — e'~*^)f2i. Obtain in this example 

by solving the equation — 2tae‘*' — 1 ■« 0. The Result is a*" *• wf db Vl ~ a*. 
Hence w *• sin"’^ «• —i log (fa d= V^l — a*). Show in the same way that 

^ t\ i 4 * 

tan ^log - 

2 t — s 


and 008 ”"' a * —»log (a 4: \/if* — 1). 
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«. Meier to Mtob. 7 and show th»t: 

{«) Huih~*« «■ log (t + V** 1), (6) coBh~* f •> log (* + V** — 1 ), 

(c) lath”* < ; log . 

$* For complex numbers o, fc, c in what sense and in what circumstances is it true 
that (aby m 

4. Analytic Functions of a Complex Variable. We say that a point 
s » X + t*!/ approaches a fixed point Zo ^ Xo + iyo if x Xo and y 
Let f{z) be a single-valued function defined in some neighborhood of tie 
point z ro- By the neighborhood of Zo we mean the set of all poirds in a 
mdffimenily small circular region with center at Zq. As 2 : Zq, the function 

/(z) may tend to a definite value wq. We say, then, that the limit of/(z) 
ajs z approaches zo is wq and write 

lim f(z) « 10 ^, 

In particular, if /(zq) « icq, we say that f(z) is continuous at 2 » 

It is not difficult to prove that if f(z) = u(x,y) + ivix^y) is continuous 
at 2o xq + iyoj then its real and imaginary parts u and v are continuous 
functions at (xo,yo), and conversely. 

Let w « fiz) be continuous at every point of some region in the z plane. 
The complex quantities w and z can be represented on separate complex 
planes, called the w and z planes. The relationship w = f(z) sets up a 
correspondence between the points {x,y) in the z plane and the points 
{u,v) in the w plane (see Figs. 6 and 7), so that the corresponding points 
(u,t?) fill some region R' in the w plane. 



Jf Zo » lo + t!/o and 2 = zo + Az are two points in the z plane with 
4z - ^ * Ay, the corresponding points in the w plane are - «o + *0 

and « - ICO + 4w, where Aw e 4« + i Av. The change Aw in the 
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value of wq -« /(«o) oorre^poiuiing to tiie inciemeat Ae in <0 is 
Aw - /(2o + As) - f(zo) 

and we define the derivative dw/dz [or/'( 2 )] by a familiar formula 


f'M 


Aw 
lim — 
-* 0 Az 


lim 

As —e 


f(zo + Az) - f(zo) 
0 Az 


m) 


It is most important to note that in this formula z ^ zo + Az can assume 
any position in the neighborhood of zq and Az can approach aero along 
any one of the infinitely many paths joining z with zq. Hence, if the 
derivative f(zo) is to have a unique value, we must demand that the limit 
in (4~1) be independent of the way in which Az is made to approach zero. 
This restriction greatly narrows down the class of complex functiems that 
possess derivatives. 

For example, if 

ta » z2, 

then on replacing zhy z + Az and ^ by 2 + we get 

w + Aw «« (z + Az)(z + Az) ^ z2 + 2 Az + z Az + Az Az. 


Hence Aw ^zAz + zAz+AzAz 


Aw Az — 

and — ^ z + z - ]- Az. (4-2) 

Az Az 

We show next that this quotient, in general, has no unique limit as Az 
is made to approach zero along different paths. Since z =» a; + iy, 

Az ^ Ax + i Ay, Az » Ax — t Ay 

and we can write (4-2) as 

Aw Ax — i Ay 

— « X — iy -f (x + iy) -h Ax — i Ay. (4-3) 

Az Ax + i Ay 

If we now let Az in (4-3) approach zero along the path QRP (Fig. 8), so 
that first QR « Ay 0 and then PR »* Ax —► 0, we get 

Aw 

lim — «= 2x. 

4« 0 Az 

But if we take the path QR'P and first allow QR' « Ax 0 and then 
R'P » Ay 0, we obtain 

Aw 

lim — » — 2ty. 

At 0 Az 
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Except fm X ^ y ^ Of these liinits are distinct, and hence w ^ z2 has no 
derivative except possibly at z =>> 0, As a matter of fact, it is possible to 
show that this function does have a derivative (whose value is zero) only 
at the point « « 0. 



On the other hand, if we consider 

tp = 


then ttj + Aw? « {z + Az)^ *= 2 ^ + 2r Az + (Az)^, so that 

Aw 2z Az + (Az)^ 

— «- 2z + Az. 

Az Az 


The limit of this quotient as Az —> 0 is invariably 2z, whatever may be 
the path along which Az —► 0. In this example the derivative exists and 
its value is 2z. 

We obtain next a set of conditions which real and imaginary parts of 


w = /(z) s u{x,y) + w{x,y) 

must fulfill if/(z) is to have a unique derivative at a given point z ~ x + iy. 
Since Aw «= An + f Av and Az = Ax + i Ay, we get from (4-1) 


m 


Au 4 -1 Av 

*= hm - 

A* 0 Az 


Au + i Av 

S5S lim -- 

Ax ^0 Ax + i Ay 
Av 0 


(4-4) 


Now, if we let Az —» 0 by first allowing Ay -* 0 and then Ax —* 0, 
we get from (4-4) 



( 4 ^) 
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If, on the other hand, we compute the limit in (4-4) by making first Ax 
and then Ay 0, we obtain 
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0 


(^ 6 ) 


Hence, if the derivatives in (4-5) and (4-6) are to have identical values at 
a given point z for these two particular modes of approach of to aero, 
we must have 

du dv &v du 

_ « -- ( 4 ^ 7 ) 

dx By Bx By 


Equations (4-7) are known as the Cauchy-Riemann equations^ and the 
foregoing calculation shows that they constitute necessary conditions for 
the existence of a unique derivative of f{z) = u{Xyy) + iv{x,y) at 2 =» x + ty. 
These equations also turn out to be sufficient ' if one further assumes the 
continuity of partial derivatives in (4-7) at the point (x,y). 

Complex functions which have derivatives only at isolated points in the 
z plane are of minor interest in applications in comparison with those that 
have derivatives throughout the neighborhood of the given point. We say 
that a function f{z) is analytic (or holomorphic) at a given point z ^ Zq if it 
has a derivative f'{z) al z = Zq and at every point in the neighborhood of Zq. 
It can be shown that the following theorem ^ is true. 

Theorem, A necessary and sufficient condition for f{z) = w(x,i/) + w(x,y) 
to be analytic at ~ ^0 + iyo is that u{x,y) and r(x,y) together itrith their 
partial derivatives be continuous and satisfy Eqs. (4-7) tn the neighborhood 
of {xo,yo). 

The points of the region where f{z) ceases to be analytic are called 
singular points of f{z). 

It is easy to show that familiar rules for differentiating sums, products, 
and quotients of real functions remain valid for analytic functions.* Also 
the formulas for differentiating elementary complex functions, defined in 
Sec. 3, are identical with the corresponding formulas in the calculus of 
re^ variables. We give a derivation of several such formulas in the follow¬ 
ing examples.* 

* A demonstration of this is given in several standard texts. See, for example, E. C. 
Titchmarsh, "The Theory of Functions/' 2d ed., p. 68, Oxford University Press, London, 
1939. 

* This theorem can be deduced with the aid of the strong form of Cauchy^s theorem 
stated in Sec. 5. 

»See Prob. 1. 

< See also Prob. 2. 
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1 . Show that d^/(l$ «*» tf*. 

!£«>•«*«» then the definition (3-1) yields 

te « w + iv e®(cos y -^iemy). 

Her®, ti ■■ e* cos j/, v * s* sin y, and it follows that 


— * £r cos y, 
dX 


£w 

ay 


—€* sin y, 


at; . 

— * e* am y, 


■» €* cos y. 
dy 


Since Eqs. (4-7) are satisfied and the partial derivatives are continuous^ dw/dz can be 
calculated with the aid of either (4-5) or (4-6). Then, 

dw 

-j- «c*co8y-fie*siny 


«■ c®(cos y + » sin y) 

Example 2. Show that (d log z)/dz «* \/z\iz 0. 

The function w » log 2, as noted in Sec. 3, is multiple-valued. However any branch 
of this function got by fixing the value of k in (3-10) is single-valued, and the application 
of Cauchy-Riemann equations (4-7) to it shows that it is an analytic function except at 
2 *» 0. On fixing k we get from ix; » log 2 a single-valued function 

2 « 


whose derivative with respect to w? by Example 1 is 


Hence 



dw d log 2 1 
dz dz z 


a z 9^0. 


The point s *» 0 is a singular point of u? » log s, since the derivative at that point 
ceases to exist. 

Example 3. Show that dz^/dz »» for all values of n (real or complex). 

If tc » 2^, then 

log w * n log 2 . 

On differentiating this with respect to 2 , we get 

1 die n 
w dz z 

’*• dw w 

Baaoe « n - - 

02 2 

since 11 ; z\ This dhrivative ceases to exist at 2 ■> 0 if n <1. 
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PROBLEMS 


1* Showthitt 

(O) ^ (/x ±/.) - /i'W ± m, (6) I (/x/,) - /i/^ + ftfl, 


ic) 


dz 


d) 




whenever /i and /2 are analytic functions. 
2. Show that 


(a) 

(d) 


d(cos 2) 


dz 

d(t>an‘ 


*» — sin 2 , (6) 
"z) 1 


(i(8m z) 
dz 


cos z, (c) 


df% dz * 
c?(t>an z) 


dz 


1 -f*' 


,^<i(8inhz) ^ 

. ^ (e) —_— . cosh z, (/) 


dz 


dz 


■» sec* z, 

« a* log o. 


8. Determine where each of the following functions fails to be analytic: (a) z* -f 2z 
(b) zf(z -f 1), (r) 1 /z -h (2 ~ l)^ (d) tan z, (e) l/[(z - l)(z + !)],{/) zi, {g) e*, {h) / 

- 2ixy, (i) x/(x2 4- 2/®) -f 4 I/*), 0) U|, (k) tan""^ z, 

6. Integration of Complex Ftmctions. Cauchy’s Integral ilieorem. We 

define the integral dz of a complex function f{z) = u{x^y) 4 iv{x^y) 

along a path C in terms of real line integrals as follows: 


f{z) dz ^ (u + iv){dx + i dy) 


= J^(u dx - V dy) + (vdx + u dy). (5-1) 

Reiil integrals of this type were studied in Chap. 5, Sec. 4, where it w^as 
observed that they exist when the fundions u(x^y) and v(x,y) are continu¬ 
ous and the path C is sufficiently smooth. 

The integral in (5-1) can also be defined in a manner of Sec, 4, Chap. 5, 
by the formula 

r ^ 

I f{z) dz E lim 23 /(f.)(z. - z«-x). (5-2) 

W CO 

It is supposed that the curve C is divided into n segments by points Zi 
and that f» is some point of the ith segment* The limit is then computed 
as tlie number of segments is allowed to increase indefinitely in such a 
way that the length of the largest segment tends to zero. The fact that 
the definitions (5-1) and (5-2) are equivalent follows from consideration 
of Sec. 4, Chap. 5. 

As an illustration of the use of formula (5-1) consider the integral 

j^fdz, ( 5 - 3 ) 
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where the path C is a straight line joining the points « » 0 and =» 1 *f 
2t (Fig. 9). Since = (x — iy)^ « x^ - - i2xy, we get, on sub¬ 
stituting u ^ — y^yV — 2xy in (5-1), 

jf ^ d 2 « ^ [(x^ - 2 /^) dx + 2xy dy\ + [-2xy dx + (x® — y^) dy]. 

(5-4) 



But the cartesian equation of C is 2 / = 2x, and hence (5-4) can be reduced 
to the evaluation of two definite integrals: 

jz^ dz = ^ 5x^ dx + — lOx^ dx = % — i'^%. 

The value of the integral (5-4) depends on the path C joining the given 
points 2 = 0, 2 = 1 -1- 2i, for according to Sec. 9, Chap. 5, a necessary and 
sufficient condition that the line integral 


f Mdx + Ndy 
Jc 


(5-5) 


be independent of the path in a simply connected region R is that 


dM ^ ON 
dy dx 


(5-6) 


throughout R. We further recall tliat in deducing the condition (5-6) 
with the aid of Green’s tlieorem it was supposed that Mix^y), iV(x, 2 /), 
and their partial derivatives in (5-6) are continuous functions throughout 
the region. It is readily checked that Eq. (5-6) is not satisfied by the 
functions appearing in the line integrals in (5-4). 

If, however, J{z) ^ u + iv in (5-1) is an analytic function, then the 
Oauchy-Riemann equations (4-7) demand that 


du dv dp du 

dx dy dx dy 


(5-7) * 
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Reference to (5-6) «3iows that these conditions are precisely those that 
ensure the independence of the path of the line integrals in (5-1), provided 
that the partial derivatives in (5-7) are continuous functions in the given 
simply connected region R, Thus, if we suppose that/( 2 ) is analytic in the 
given simply connected region and /'(«) is continuous there, then the 
integral 

jj{z)dz 

is independent of the path joining any pair of points in the region. If 
the path C is closed, then the value of this integral is zero. We thus 
have a theorem, first deduced by Cauchy, which is of cardinal importance 
in the study of analytic functions. Although the foregoing proof assumes 
the continuity of /'(z), the theorem can actually be established^ under 
the sole hypothesis that f{z) exists at each point of the region, and we 
state it in this strong form. 

Cauchy's Integral Theorem, If f(z) is analytic at all points within 
and on a closed curve C, then j^f(z) dz = 0. 

We conclude this section by deducing, from definition (5-2), a useful 
inequality furnishing an upper bound for the value of tlie complex integral 

^/(s) dz. Inasmuch as the modulus of the sum of complex numbers is 
never greater than the sum of the moduli, 

\j^mdz\<j^\f{z)\-\dz\. 

Now, if the modulus 1/(2) | of f{z) along C does not exceed in value some 
positive number M, then 

I d2 j ^ ~ ^ ~ 

where L is the length of C. 

As an illustration of the use of the inequality (5-8) we apply it to deduce 
an upper bound for the integral (5-3). The modulus of takes its maximum 
at the point 2 = 1 -f* 2i. Hence we can take M in (5-3) as j 1 + 2^p =« 5, 
and (5-8) then yields 

I ^ 2 ^ d2 j < 5^/5, 

inasmuch as 1/ =« y/d for the rectilinear path in (5-3). 

* See Titchmarsh, op. ciL, pp. 75-83. In a somewhat different development of the 
subject one deduces the continuity of /'(z) from Cauchy’s theorem, not the other way 
about. We shall see in Sec. 7 that the theorem actually implies existence and continuity 
of derivatives of all orders. 
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PHOBLBMS 


!• Find the vftiue of the integral { z^dz along the rectilinear path joining the points 

Jc 

g m 0 and « «■ 2 + t. Show that this integral is independent of the path. 

3L Find the value of the integral f 2 dz along the rectilinear path y ^ x joining the 

Jc 

points (0,0) and (1,1) and also along the parabola y « x** joining the same points. 

$, Show that the integral f S dz evaluated over the path [zj « 1 in a counterclock- 

Jc 

wise direction yields 2iri. Note that z « and t *» along the path iz| h 

f^z — 1 

4. Find the value of the integral / - dz, where the path is the upper half of the 

7-1 z 

circle |z| •• 1. Calculate the value of tliis integral over the lower half of the circle 

1*1 

6, Show that / (1 *4" **) dz ia independent of the path C, and evaluate this integral 

Jc 

when C is the boundary of the square vdth vertices at the points z*«0, z«»l,«»l-ft, 
and z i, 

6. What is the value of the integral j e*‘ dz where C is the boundary of the square in 

Prob. 6? .f 

7 . Find the value of the integral / e‘ dz over any path joining z » 0 and z « ri. 

Jo 

S« Use formula (5-8) to show that: 

I I { dz 1 I /** I 

(a) j z^ dz I < 10, (^) I j 2 * I \ j dz j < 2, 

where paths are straight lin^is joining the points appearing in the limits of these integrals. 


6. Cauchy’s Integral Theorem for Multiply Connected Regions. In 

establishing Cauchy’s integral theorem in the preceding section we assumed 
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that the region bounded by the 
curve C is simply connected. It is 
easy to extend this theorem to 
multiply connected domains in the 
manner of Sec. 9, Chap. 5. Thus 
consider, for definiteness, a doubly 
j connected region (Fig. 10) bounded 
I by closed curves Ci and C 2 , where 
/ C 2 lies entirely within Ci. We 
assume that f{z) is analytic in the 
region exterior to C 2 and interior to 
Cl and analytic on C 2 and Ci. The 
requirement of analyticity on Ci and 
C 2 implies that the function fiz) is 


analytic in an extended region (indi¬ 
cated by the dashed curves Ki and K 2 ) that contains the curves Cj and Cg. 
If some point A of the curve 0% is joined to a point B of C 2 by a crosscut 
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AS, then the regkm becomes simply connected and the thcomn of Caudiy 
is applicable. Integrating in the positive direction gives 

^AFA Lb ^BQB La 

where the subscripts on the integrals indicate the directions of integration 
along Cl, the crosscut AB^ and Cs* Since the second and the fourth 
integrals in (6-1) are calculated over the same path in opposite directions, 
their sum is zero and one has 

f{z) dz + /(z) dz « 0, (6-2) 

where the integral along Ci is traversed in the counterclockwise direction 
and that along C 2 in the clockwise direction. Changing the order of 
integration in the second integral in (0-2) gives 

/(z) dz « f{z) dz, (6-3) 

We see that the values of the integral of f{z) over two different paths 
Cl and C 2 are equal, but they need not be zero inasmuch as f{z) may not 
be analytic at every point of the region bounded by C 2 . But whatever 
may be the value of the integral over the path C 2 , it is the same as its 
value over the path Cj. An important principle of the deformation of 
contours follows at once from this observation: The integral of an analytic 
function over any closed curve Cx has the same value over any other curve C 2 
into which Ci can he continuously deformed without passing over singular 
points of f(z). 

We shall see that this principle will enable us to simplify the computation 
of integrals of analytic functions. 

The foregoing results can be extended in an obvious way to yield the 
following theorem: 

Theorem. If f{z) is analytic in a closed multiply connected region hounded 
hy the exterior curve C and the interior curves Ti, C 2 , ..., then the in¬ 
tegral over the exterior curve C is equal to the sum of the integrals over the 
interior curves provided that the integration over all the contours is performed 
in the same direction. 

It should be noted that the requirement of analytic!ty of f{z) in the 
closed region implied that f{z) be analytic on all contours forming its 
boundary. 

Before considering applications of the theorem of this section to specific 
problems, we deduce an important result which will enable us to compute 
many integrals by a method which is vastly simpler than that developed 
in Sec* 5* 
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7. The Ftmd«m«atal Theorem of hitegral Celculue. Let /(z) be aaal 3 ^c 
in & simply connected region R (Fig. 11), and let C be a curve joining two 

points Po and P of the region determined 
by the complex numbers zq and z. We con¬ 
sider the integral 



rmdz 

j to 


(7-1) 


along C. Since f{z) is analytic, the integral 
(7-1) is indept'ndent of the path, and its 
value is completely determined by the choice 
of zo and z. If zo is fixed, the integral (7-1) 
defines a function 

F(z) = r/(2)dz (7-2) 

for every choice of z in R. 

To emphasize the fact that the integration variable z plays a distinct 
role from the variable z appearing in the upper limit of the integral, we 
can rewrite (7-2) as 


F(z) 


Jzo 


(7-3) 


We prove next that F(z) is an analytic function and^ moreover, its 
derivative at any point z has the value of the function in the integrand 
at that point. That is, 

F\z) ^f{z). 

We can use (7-3) to compute the difference quotient 
F{z + Az) — F(z) 

Az 


Az 


1 ri+A* 

-/. m^. ( 7 . 4 ) 


and rewrite (7-4) by adding and subtracting/(z) in the integrand: 
F(z Az) — F(z) 1 f*+A» 

—- . . I — ... sa - 

Az Az 

1 

j, ' Az 


[/(r) -/(*) 4-/(z)]df 


/•jp+As 1 rg-fAv 

■m (/(f) 

; Ay Jz 
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dt ** Azj so that 


F(z + Az) — F(z) 1 

^ = /(2) + _ / [/(f) -/(2)] dr. 

A* *'* 




Now if 


lim ^ -/(2)ldf-0, 

A* 0 Az 


(7-6) 

(7-6) 


then it would follow from (7-5) that F'(z) « f(z). The fact that the limit 
in (7-6) is, indeed, zero follows at once from the estimate (5-8), for if 
M - max !/({*) — f(z) | on the path joining z and Az, then 


1 rz^dit 

- ] [/(f)-/(«)ldf 

Az 


< Af. 


But since f(z) is continuous, Af 0 as Az —> 0. 

Any function Fi{z) such that F[iz) ^ f(z) is called a primitive or an 
indefinite integral of /(z). As in real calculus, it is easy to prove that if 
Fi{z) and F 2 (z) are any tw^o indefinite integrals of /(z), then they can 
differ only by a constant.^ 

Hence, if Fi{z) is an indefinite integral of /(z), it follows that 
F{z) - ffiz) dz = F,{z) + C. 

JZ9 

f*0 

To evaluate C, set z » z©; then, since / /(z) dz » 0, C = Thus 

Fiz) = (‘/(z) dz = Friz) - Frizo). (7-7) 

J Zii 


The statement embodied in (7-7) establishes the connection between line 
and indefinite integrals and is called the fundamental theorem of integral 
calculus because of its importance in the evaluation of line integrals. It 
states that the value of the line integral of an analytic function is equal to 
the difference in the values of any primitive at the end points of the path of 
integration, 

> Proof: Since F[(z) » F^^z) «« /(z), it is evident that 


F[{z) - F^iz) 


d(Fi - Ft) dO ^ 

- B - a* 0. 

dz dz 


But if dG/dz ** 0, it means that 0\z) -■ (du/dx) -f i(dv/dx) » (Bv/dy) — Hdu/By) •» 0, 
so that Bu/Bx *• Bv/Bx - Bu/By «• Bv/By « 0, and thus u and v do not depend on x 
and y. 
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Mwampile I* As lua iUustratiini of the use of formula (7-7) ooueider the e'^ahiation oi 

fz^d^ m) 

JC 

along some path C joining * «« 0 and i 2 t. Inasmuch as /(*) *• ** is analytic 
throughout the finite « plane, the integral (7-8) is independent of the path. Moreover, 
aiiice F(f) ■■ is an indefinite integral for f(z) ■■ 2 ®, we can write 

|2-M I 


r 


+* 1 
z^dz •* r I 


:(2+t)®. 


The reader should contrast this computation with calculations required for solving this 
in Prob* 1, Sec. 5, 

Examj^ 2. Evaluate I e^dz over some path C joining « - 0 and * * xi. Since c* is 
JC 

analytic, we get at once from (7-7) 


r 


«*cte ’ 


- 1 - -2. 


We indicate the nature of required calculations if this integral were to be computed 
by the method of Sec. 6. We first separate the integrand into real and imaginary parts, 

•« ■■ 6* cos y + sin 1 /, 

and form two real line integrals 

I e*dz ^ / (c* cos y + sin y)(dx 4* i dy) 

JQ Jo 

■■ y^(c* cos y di — «f* sin y dy) + t j sin y dx + c* cos y dy). 


Since these line integrals are independent of the path, they may be evaluated over any 
convenient path joining the points (0,0) and (0,7r) corresponding to r « 0 and z « xi. 
The result of such calculations would yield -2, as the reader can verify. 

Mxampk 3, Discuss the int^igral j(z — a)^ dz, where m is an integer and o is a 
constant. 

The function f(z) - (z *- a)*" is obviously analytic at all points of the z plane as long 
as m is a positive integer. If wi < 0, we write w « -~n and consider 




1 


where n is a positive integer. 


(z ~ a)” 


To evaluate 


/ (z — o)*" dz for m> 0, we note that 
(z - 

F(z) - ^ ^ 


w 4* 1 

is an indefinite integral for/(z) (z - a)”*. Accordingly 

Jh m + I L 


(7-9) 


(7-10) 


If, in particulM, the path C is closed, so that the Umits in (7-10) coincide, we conclude 
that the value of the integral is zero. This result also follows from Cauchy’s theorem, 
rinee/(*) » (* — a)” i»«analyUc for all values of t when m > 0. 
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and note first that if the path C passes through the point ir » a, the integrand becomes 
meaningless at z •* a. In this book ^ we shall not consider in derail integrals over those 
paths that go through singular points of the integrands, but special types of such inte¬ 
grals will occur in Sec. 22. 

If C is a closed path and a is not in the region R enclosed by C, the integrand in (7-11) 
is analytic in the closed region R. Hence, by Cauchy’s theorem the value is zerp. If, 
however, a Ue.s in R, Cauchy’s theorem 
does not apply, since /(z) « 1/(2 — a)" 
ceases being analytic at z « n. 1 'he into- f 

gral (T-ll) can, of courst^, be evaluated by I 

the method of Sec. 5 once the equation of y \ 

C is specified. However, it is wise to sim- \C 

plify calculations by making use of the J ✓—\ 

principle of deformation of contours. This / / \ 

principle states that when z — o is in the ® y ^ 1 

interior of C, / ^ 1 


*c(z~o)" Ty{z--a)^ 


where 7 is a circle with center at a and 

with radius p so small that 7 liiis within C q- 

(Fig. 12) But the integral over 7 is easily 
evaluated. Setting z ^ a ^ pc‘^ w^e get 
dz « pe^t dd on observing that p is constant on 7 . Hence 
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^ dz pe^^i de _ i 

9c (r-^r ” 9y i 


Tciz-ar T 

If n »■ 1, we get 




g(l—2r 

U(i -T) 0 


if n 1. 


[--if 

Jez - a Jo 


de « 2W. 


In evaluating the integral (7-12), we noted that the integrand for n 1, is 

the differential of ^^^”*"^^ 7^(1 — ri), and we made use of the fundamental theorem of 
integral calculus. 


f dz 

Example 4. Evaluate the integral / - 5 ——, where C is the circle -f w® « 4 . 

./<7Z* — 1 


The function 




1 1 
2* ~ 1 ® (2 - 1)(2 + 1) 


(7-14) 


^ When 2 w a lies on the path of integration, the integral in ( 7 - 11 ) is an improper 
comiilex integral and it calls for special considerations analogous to those required to 
treat improper real integrals. Certain types of improper complex integrals are of in¬ 
terest in applications. See, for example, N. I. Musklielishvili, “Singular Integral Equa¬ 
tions,” P. Noordhoff, N,V., Groningen, Netherlands, 1953. 
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hai two singular points « — I and « ■* — both of which lie within the given circle 
1*1 < 2 (Fig, 13). If we delete these points ifrom the circular region C by circles yi and 
yt of sufficiently small radii, /(*) will be analytic in the triply connected domain exterior 
to 71 and yt and interior to C. Then Cauchy^s theorem for multiply connected domains 
permits us to write 

f /(*) f /(*) dz^f m dz. (7-16) 

Jc Jyi jy% 



The integrals in the right-hand member in (7-15) are readily evaluated. Since 


we get 


1 _ 1 1 i 1 

(s ~ 1)(2 -h 1) 2z - 1 2 7+7' 


i. 


1 


(z ~ l)(z + 1) 



1 f dz 

2 y-yj z + 1 


(7-16) 


If the radius of 71 is such that 71 contains within it z » +1 but not z » — 1 , then 


by (7-13), and 


L 

L 



dz 


2x1, 

0 , 


by Cauchy's integral theorem, for l/(z + 1 ) has no singularities within 71 . 
first integral on the right in (7-15) has the value xt. An entirely similar 
shows 


Thitrefore, 


i {* - l)(z + 1 ) *-(«)+(-«•)- 0 , 


Thus, the 
calculation 


einen though the integsand is not analytic in the region 1*1 2 . 
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PROBLEMS 

1 . Show that f zdz ^ — «g) for all pathii joining zo with *. 

-'So 

2 . Evaluate the integral / (2 — a)‘“^d 2 , where C is a simple closed curve and o 

Jc 

is interior to C, by expressing it as a sum of two real line integrals over C. Hint: Set 
2 — a •» then dz «« e^*(<ip ip d6). 

3. Evaluate / 2 “"^ ds where the path C is the upper half of the unit circle whose 

Jc 

center is at the origin. What is the value of this integral if the path is the lower half of 
the circle? 

4 . Evaluate / 2 ^“* d 2 , where C is the path of Prob, 3. 

Jc 

6 . P>aluate / (z^ — 22 -f 1) dz, where C is the circle x* -f y* ■» 2 . 

Jc 

f z Hh 3 

6 . Discuss the integral / — 5 — dz , where C is a path enclosing the origin. 

Jc 

7. What is the value of the integral / (1 -f- dz, where C is the circle a:* + V* “• 

Jc 

8 . Discuss Prob. 7 by noting that 

1 


97 


1 -f z* 


i(_i- 

2 i\z — t 2 4* i/ 


and evaluating the integrals over the unit circles whose centers are at 2 » » and 2 *• “-t. 
Note the theorem of Sec. 6, 

9 « Show that the integrals (a) f ~ , (6) f mxzdz, (c) f ze* dz, (d) (dz vanish 
Jc z — 2 Jc Jc Jc 

if C is the unit circle \z\*» 1. 

10. Evaluate the integral f — ■-d 2 along the following paths C: (o) | 2 | « 

Jc 1—2 

(b) |2| «* 2, (c) 1 2 — 11 “ 1, (d) 1 2 4* 1 1 « 1. Hint: Decompose the integrand into partial 
fractions as in Prob. 8. 


8 . Cauchy’s Integral Formula. In this section we deduce with the aid 
of Cauchy^s theorem the remarkable fact that every analytic function/(«) 
is completely determined in the interior of the given closed region R when 
the values of f{z) are specified on its boundary. 

Let f{z) be analytic in a simply connected region R and on its boundary 
C. If a is an interior point of R^ then the function 


m 

z — a 


(8-1) 


is analytic in R with the possible exception of the point g « a. If this 
point is excluded from the region by enclosing it in a circle 7 of radius p 
and with center at a (Fig. 12 ), then ( 8 - 1 ) will surely be analytic in the 
region exterior to 7 and interior to C. 
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It fdloWB, tibffla, fHm (6-3) that 

m 

' z 


Jc z ^ d "'y 


m 


dz 


(8-2) 


where the paths C and y are described in the same sense. Now the integral 
in the right-hand member of ( 8 - 2 ) can be written as 


Jy z — a ■'y 


m -m 


2 — a 


dz 


dz 


But by (7-13) 


fy z 


dz 


= 2?^, 


(8-3) 


(8-4) 


and we sliall show next that the first integral on the right in (8-3) has the 
value zero. Indeed, if w^e take z — a - then, as long as z is on 7 , 
dz — ipe'^ d$f and therefore 


/ — - - dz^ if Ifiz) - fia)] dd. 

■^7 z — a ■'7 


Let the maximum of \f{z) — f{a) | be M ; then by (5-8) 


/, 


f{z) - fia) 


dz 


/•2t 

< M / dB 
Jo 


2irM. 


(8-5) 


( 8 - 6 ) 


The radius p is arbitrary, and if we make it sufficiently small, then 
max \f{z) — /(a) | can be made as small as we wish, since/fz) is a continuous 
function. Accordingly, Af 0 as p 0 On the other hand, from the 
principle of deformation of contours, the value of the integral (8-6) is 
independent of the radius p. Since M 0 when p 0, we conclude 
that the value of the integral (8-5) is zero. 

Accordingly, (8-3), together with (8-4), gives the result 


r m 
z — a 


dz — 27r//(a). 


(8-7) 


We recall that the point a is any interior point of the region R bounded 
by C and z is the variable of integration on the contour C. If we denote 
the variable of integration by f and let z be any interior point, we can 
rewrite formula (8-7) as 



/(r)dr 
f - 2 


( 8 - 8 ) 


Fmnula (8-8) permits us to calculate the value of /(«) at any interior 
ptihit from specified: boundaiy values /(f) on the contour C. It is known 
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as Cawchy^B integral formula. This formula can be extended in the manner 
of Sec, 6 to multiply connected domains bounded by the exterior contour 
Co and m interior contours Ci, Cn». The integration in (8-8) is 

then performed in the clockwise sense over the interior contours and 
counterclockwise over the exterior contour Co* 

It is not difficult to show with the aid of formula (8-8) that an analytic 
function /(«) has not only continuous first derivatives in the region but 
also derivatives of all orders. Thus an analytic function can be differen¬ 
tiated infinitely many times. 

In fact, if we consider an integral of Cauchy^s type^ 


F{z) 


1 

2x1 


i icf 


m 




(8-9) 


where /(f) is any continuous (not necessarily analytic) complex function, 
then this integral defines an analytic function F{z), To show this we 
merely have to prove that F{z) has a derivative at every point of the 
region R bounded by C. We form the difference quotient with the aid of 
(8-9) and get 


riz) 


lim 

A* 0 


F{z + Az) — F{z) 
Az 


to 

0 As L 2x1 ^ — {z + Az) 2x1 ■'c f ^ J 

to [1/--1. 

A* 0 12x1 JC z — Az)(f — z)\ 


On taking the limit as A^ 0 under the integral sign, which is legitimate 
if /(f) is continuous, we get 


f'M 


-/ 

2W (f 

Ccmtinviing in the same way, we find 


/(f) 


zy 


;dl. 


F"{t) 


IL 

2« ‘c (f - z)3 




r m 

2iri Jc (f _ j)"+i 


We have thus shown that F(z) defined by (8-9) has derivatives of all 
orders even when nothing is said about the relation of the values of F{z) 
on the boundary C to the function /(f) appearing in the int^and. In 
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tlM case when fQ;) «= F(^), we have a formula for the wth dfflivative 

of the aiuklsrtic functicm /(*) at any interior point of iZ in terms of the 
values of /(«) on C: 


/<->(*) 


nl r _^ 

2xi ic (t - 


/(f) 


2« Jc (f - z)«+i 


df, 


0 , 1 , 2 , .... 


( 8 - 10 ) 


We cotLelude this aection by noting some important consequences of formula (8-7). 
Let the path C be the circle |— o | « p with center at r •*» a and with radius p. Sup¬ 
pose that the maxiinum value of the modulus of f(z) on this circle is M; then by (5-8) 


l/(a)l<~-2rp 
2r p 


M, 


This result is independent of the radius p. Consequently |/( 2 ) | at the center a of the 
circle is not greater than its maximum value on the boundary. Using this result one can 
prove that if /(*) is analytic in a given region jfi bounded by a curve C\ and if Af is the 
maximum value of \f{z) | on C, then \f(z) ] < M at each interior point of R unless 
l/(*)l** ^ throughout the region. This result is known as the maximum modulus 
theorem.^ The fact that l/(r)|< M follows from Sec. 24, Chap. 6, if we note that 
log \f(z) I is harmonic. 

Example 1. Find the value of the integral / —™ dz if C is the ellipse -f 4y^ ** 1. 

Jc * 

Since sin z is analytic in the region bounded by C, formula (8-7) yields, upon setting 
/(*) sin s and a » 0, 


/sin z 

Jc * 


Example 2. Evaluate the integral J 


Jcz-f-l 

The point « «• — 1 lies within the given circle, and since e 
formula (8-7) yields 

/ e'~* I 

I —r*T * »* e2jrt. 

Jez + I _1 


dz over the circular path | z | « 2. 

is analytic within C, 


Example 3, Find the value of the integral 

tan z 


L 


-^dz, 


Jc Iz - (ir/4)p 

where C is the circle |s| •» 1. 

The point z • ir/4 lies within C, and we note that tan z is analytic for jz | < 1. From 
( 8 - 10 ) 

Hence 


idz. 


2in J (z o)* 

/ tan * - « . /d tan z\ „ x 

___ * . 2« - 2« sec* - - 4«. 




‘See proof, for esuqpte, in E. C. Titchmarsh, “The Theory of Functions,” 2d ed., 
p. 164, Chdord Univentty Preat, London, 1030. 
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PROBLEMS 


1. If /(*) 


■/c 


3r® + 7f +1 


(if, where C is the circle of radius 2 about the origin, 


Jc r - « 

find the value of /(I — i). 

2. Apply Cauchy’s integral formula to Prob. 7, Sec. 7. Use the integrand in the 
form given in Prob. 8, Sec. 7. 

3 . Evaluate the following integrals over the closed path C formed by the lines x • d=l, 

f sin z f cos z t * t 

»-±l; (o) /-(6) /- dz, W / -— (A) ((mnz + endz, 

Jc ^ Jc ^ Jc ^ ^ w Jc 

r eosh z , 

(e) - - dz. 

Jc 2 

4 . Evaluate with the aid of Cauchy’s integral formula 


X 


3f'*f r 




1 




where C is the circle [f | — 2. HirU: Decompose the integrand into partial fractions. 

5. What is the value of the integral of Prob. 4 when evaluated over the circle | f — 11 
» 1? Hint: Note that (3f® + f)/(f + 1) is analytic for jf — 11 < 1. 

1 * 32 ^ 4 " 22 — 1 

6 . Evaluate /- dz, where C is the circle \z\^ 1. 

Jc 2 

7. Can 1/(2) 1 assume a minimum value at an interior point of a region within which 
f{z) is analytic? Consider fiz) » 2. 

8 . Can 1/(2) I assume a non2ero minimum at an interior point of a region within which 
f(z) is analytic? Hint: Consider l//(2). 


9. Harmonic Functions. We saw in the preceding section that a function 
analytic at a given point of the region has derivatives of all orders at that 
point. It follows from this that the real and imaginary parts of an analytic 
function f{z) ^ u + iv have partial derivatives of all orders throughout 
the region where f(z) is analytic, for by (4-5) and (4-6) 


fiz) 


du dv 

-h i — 

dx dx 


and since/'( 2 ) is also analytic, 


riz) 


- -1~ i - 

dx^ dx^ 


d^v 


dv 

dy 

d^u 


du 


d^U 


dh 


dx dy dx dy dy^ dy^ 


The fact that f\z) is analytic enables us to differentiate again to obtain 
the third partial derivatives, and so on. 

Inasmuch as the existence of the third partial derivatives ensures the 
equality of mixed partial derivatives of the second order, we can show 
that the real and imaginary parts of an analytic function satisfy Laplace's 
equation throughout the region of analyticity of Siz)\ for on differentiating 
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tile first of Cattdiy>lUemann equations (4-7)with respect to y and the second 
vith respect to x, we get 


dy dx 

and adding these we find 


dh 


d^v 




0 . 


The fact that u also satisfies Laplace's equation 

0 


d*w 


d^u 
dx dy 


follows similarly from the differentiation of the first of Eqs. (4*7) with re¬ 
spect to X and the second with respect to y. 

Any real function uix^y) with continuous second partial derivatives 
which satisfies Laplace^s equations in a given region is called harmonic 
in that region. Thus the real and imaginary parts of a function analytic 
in the region R are harmonic functions. Two harmonic functions uix^y), 
v(x,y) such that u + iv is an analytic function/( 2 :) are said to be conjugate 
hxirmonics. We sliall show next that if one harmonic function is given, its 
conjugate harmonic can be determined to within a constant of integration. 
For, let u{x^y) be given in R. Then if v(x,^) is a conjugate harmonic, these 
functions satisfy the Cauchy-Riemann equations 



du dr 

du 

dv 




dx dy 

^ ~ 

dx 




dv 

dv 


du 

du 

Hence 

dv — 

dx - dy 

ss — 

— dx A - dv 


dx 

dy 


dy 

dx 

and, since du/dx and du/dy are known from uix^y)^ we have 



r{x,v) / 

du 


du \ 


v{x,y) 

= /, . (- 

■ — dx + 

— dy]^ 





dx / 


where the integral can be evaluated over any path joining an arbitrary 
point (a:o,^o) of R with (x,y). Since the value of the line integral (9-2) 
depends on the choice of (xo,2/o)> it is clear that v{Xyy) is determined only 
to within an arbitrary constant. The integral is independent of the path 
inasmuch as 

dy \ dy/ dx \dx/ 

^Bd this equation is trae because u{x,y) is hannonic. It should be noted 
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hat when the r^on E is not simply connected, the function vix^y) may 
um out to be multiple-valued.^ 

The connection of analytic functions with Laplace^s equations is one 
>f the principal reasons for the importance of the theory of functions of 
omplex variables in applied mathematics. 

In the preceding section we noted the maximum modulus theorem for 
uialytic functions. This theorem enables us to prove the important fact 
hat the maximum values of harmonic functions {which are not mere constants) 
ire invariably assumed on the boundary of the region. 

Let u be harmonic in the region R whose boundary is C. If i> is a con- 
lugate harmonic, then u + iv is an analytic function, and therefore the 
function 

gw-fiv _ e“(co 8 1; + f sin v) 


[s also analytic. But the maximum of is assumed on the 

boundary C of by the maximum modulus theorem. Since c“ takes on 
its maximum on the boundary C, u{Xjy) must assume its maximum on C, 


Example- Tlie function u » x* — 2/^ is harmonic in every region. Obtain a conjugate 
harmonic v. 

Inserting u in the formula ( 9 - 2 ) yields 

Ax.v) A*,v) 

v{x,y) *« / {2y dx -\r2xdy) ** 2 / d{xy) « 2xy -f c, 

•'(Xo.J/o) ■'(xo.vo) 

where c * — 2 xoyo, 

In tliis problem the integrand is so simple that we wrote its differential by inspection. 
In a mort! complicated case it may prove more expedient to evaluate the integral over 
some convenient path rather than reduce the integrand to the form of a differential of 
some function. 


PROBLEMS 

1 . Prove that v ** 3 x®y ~ y* is harmonic, and find a conjugate harmonic u. 

2 . Find an analytic function/{«) «* u -f* ie if; 

(o) xt ** x; 

(5) u •« cosh y coax; 

(c) u « x/(x^ + y^); 

(d) u » cos y ; _ 

(<?) u « log Vx® -f* y^- 

10. Taylor’s Scries. In this section w^e are concerned with the power- 
series representation of analytic functions. The reader is advised to review 
Secs. 8, 9, and 16 of Chap. 2 dealing with the properties of power series. 

eo 

Here we recall that when the power series 53 converges for z «« zi, 

k-fO 

it converges absolutely and uniformly in every closed circular region 
\z\<T, where r <\zi\. A circle of radius r such that converges for 

»See Sec, 6, Chap. fi. 
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la^l < r and div^^rges for every |rl > f is called the circle of convergence^ 
and the number r is the radius of convergence. The radius of convergence 
can frequently be determined with the aid of the ratio test. Thus 


lim 


whenever this limit exists.^ 


On-l 


** (—l)V 

Example: The series ^ -has the radius of convergence r 


1 , since 


lim 


On I 

eo 

The series 2^ nis** converges only for z * 

n^mO 

Um 


lim 


n— ♦« n 

0 , since in this case 


lim - ■ 

n—» w 71 


= 0. 


On the other hand, the series 2J converges for all values of r, since 

n-fO nl 


lim 

n —♦ w 


On 


« lim 


Too (n — 1)! 

We saw in Sec. 9, Chap. 2, that with every real function f{x) having deriv¬ 
atives of all orders at a given point x == a, we can associate the power series 

00 

X) a»(* - o)" 

n«BsO 

with On = f^Ha)/n\ which usually converges to f(x) in some interval 
about the point x = a. However, the existence of infinitely many deriva¬ 
tives at X == a does not ensure the 
convergence of the series 2Jan(x — o)"^ 
to/(x). To ensure convergence, the 
remainder in the Taylor formula (9-1) 
of Chap. 2 must approach zero. 

Inasmuch as every function f{z) 
which is analytic at 2 = a has in¬ 
finitely many derivatives at that 
point, we can write down the series 
* 

L—r(2-ar 

which converges in some circular 
region [z ~ a| < r. The question is: 
Does such a series invariably converge to /(z)? 

We prove next (in contradistinction to the situation with the correspond- 

> See See. 8, ChAP< 
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iog real aeries) that aiMil3rtic functions can always be represented by power 
series. 

Let f(z) be analytic in some region E, and let C be a circle lying wholly 
in R and having its center at a. If z is any point interior to C (Fig. 14), 
then it follows from Cauchy’s integral formula that 




_1_ r m) 

2ri 'C f — z 




±fM- t ^ 

2in f — a 1 1 — (i; — o)/({' — J 


But by long division 


( 10 - 1 ) 


1 - t 1 - « 


and substituting this expression with < = (z — a)/(f — o) in (lO-l) leads to 

.. s 1 r f /(f) ^ r /(f) .. . 

„ (z - a)" 


where 


i 


/(f) 


2Tt- Jc (j- _ a)»(f - z) 


df- 


Making use of (8-10) gives 
m = /(a) + /'(a)(r - a) + (z - a)^ 

+ ■ • • + T—(^ - “)" + 

(n - 1)! 


By taking n suflRciently large, the modulus of i?„ may be made as small 
as desired. In order to show this, let the maximum value of j/(f) | on C 
be M, the radius of the circle C be r, and the modulus of z — a be p. Then 
If — r|>r — p, as shown in Fig. 14, and 


|fi»l 


|z - q|" 

2x 


fc (I- — 


/(f) 


^ (f - a)’*(f - z) 


df 


^ p" M2irr Mr /pV 
~ 2x r“(r — p) r — p \r/ 

Since p/r < 1, it follows that lim | jB„ j >= 0 for every z interior to C. Thus, 
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one cab write the infinite aeries 

f'Ha) (o) 

m - m + r(a)(z - a) (* - o)®+• • • +'—— (*-«)"+••• 

(i(w) 

which converges to f{z) at every point « interior to the circle \z — a\ « r. 
The series (10-3) is the Taylor series of fiz) expanded about the point 
* « a. As in Chap. 2, Sec. 9, one c^n prove that the representation (10-3) 
is unique. 

Let 2 =“ 2 o be the singular point of/( 2 ) nearest z - a; then f{z) is analytic 
in the circular region 1 2 — a j < ro, w'here ro = 12o — a |. This circular 

region will then be the circle of 
convergence of the series (10-3) in¬ 
asmuch as the series diverges for 
|2 — a| > ro. It should be noted, 
however, that there may be points 
of the region R where f{z) is analy¬ 
tic which lie outside the circle of 
convergence of this series. How¬ 
ever, one can always choose a new 
point a about which tlie expansion 
is performed so that the circle of 
convergence of Taylor’s series about 
that particular point contains with¬ 
in it the desired value of z as long 
as f{z) is analytic at 2 . In this 
manner the region R can be covered 
by a set of overlapping circles each 
of which is associated with some 

For example, if /(«) * 1/(1 — 2 ), then the expansion of f(z) about 2 *■ 0 yields 

/(«) » 1 

The circle of convergence of this series is I*] *» 1. But /(z) « 1/(1 — 2 ) is analytic at 
f (f^)i (Fig. 15), which lies outside the circle | 2 ( * 1. If we take a ■« t, the formula 
(10-3) yields the series whose circle of convergence is [2 — t| « \/5, and this circle in¬ 
cludes the point z « (?^)». The reader may find it instructive to deduce the expansion 
for f(z) ■•1/(1 — 2 ) in powers of 2 — i and determine the radius of convergence with 
the aid of the ratio test. 



Taylor-series representation of/( 2 ). 


PROBLEMS 

1. Expand/(z) •• 1/(1 - 2 ) in Taylor's series about (a) 2 - 0, (6) 2 •» -1, (c) 2 t, 
ind draw the circles of convergence for each of the series. What relation do the radii 
of convergence of these series bear to the distance from the point 2 «« 1 to the pohit 
about which the series exoansion is obtained? 



m 
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%, Expand f{t) log «in tlie Taylor serios id3out» -• 1, and determine the radiue of 
oottvergenoe. 

S. Obtain the Taylor expansion about a » 0 for the following functions, and deter¬ 
mine the radii of convergence of the resulting series: (a) a*, (6) aina, (c) cos*, 
(d) log (1 4- *), («) cosh *. 

4 . Expand /(*) « sitih * in Taylor’s series about the point * «• nd, and determine the 
radius of convergence of the resulting series. 

5. Discuss the validity of the expansion (1 4* *)"* • 1 + w* + [m{m — 1)/2I]** 4-' * • 
for arbitrary values of m. 

6. Verify the expansions: 

(») 4 - £ + 1)" for|*4-l|<l. 

* n-O 

(b) a* *■ e X-J— 1*1 < **' 

n»o n! 

11. Laurent's Expansion. We have just sliown that a function /(*) 
which is analytic at a given point a can be represented in the neighbor¬ 
hood of that point in a power series. Moreover, this series represents 
/(*) in the interior of the circular region centered at a and whose radius is 
equal to the distance of a from the nearest singular point /(z). In this 
section we prove a more general theorem due to Laurent. 

Laurent’s Theorem. A function /(*) analytic in the interior and m the 
boundary of the circular ring determined by \z -- a\ — Ri and |z — a| «= J?^, 
with R 2 < Ri (Fig. 16), can be represented at every interior point of the ring 
in the form 

fiz) = o „(2 - a)” + 2 7 (IM) 

n-O n-l (z - a) 

where ~ n « 0, 1, 2, ,.(11-2) 



Fig. 16 
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2fnJCt{i - o)-"+‘ 


dt, 


n- 1,2, ..., 


(11-3) 


Cl and C 2 being the boundaries of the ring. 

To prove the theorem we recall that Cauchy’s formula (8-8), when 
applied to the circular ring, enables us to write 


m 


1 



f — 2 2iri c-J f — 2 


(11-4) 


where z is any point in the interior of the ring. 

We show next that the integrals in the right-hand member of (11-4) can 
be represented by the series appearing in (11-1). We begin with the 
integral over Ci and note that if f is on Ci and z is in the ring, then 


1 ^JL_1_ 1 ^ (g -- 

f -2 f-al-( 2 - a)/(i‘ - a) f ~ a (f - o)” 

since^ \z — a|/|f — aj < 1. Thns, 

1 y±ZJ^ 

, , f - 2 “ ."i (f - a)’’+' 

and hence 

Tc, f _ 2 2« J -i „»0 (f - a)”+‘ 


(11-5) 


Since integration of the series term by term can be justified as in the dis¬ 
cussion of (10-1), we can write 


_i_ / /(r) dr 

- 2 




Tc, (r - 


m 


2rci‘ 


c. (f - a) 


-t+i 




00 


X) an(« - a)", 

n»«0 


where we define On by the formula (11-2). This establishes the equality of 
the first terms in the right-hand members of (11-1) and (11-4). 

We consider next the second integral in (11-4). If f is on C 2 , then 


1 


1 


f — 2 2 — a 1 — (f — o )/(2 — a) 

since [f — a |/|2 — a| < 1 in this case. Hence 


- £ 


(f - o)" 

- ’ 


(^ o)' 


u 


m 




1 


00 

4 £ 


/(f)(f - o)« 


‘Note 


2« 7c, f _ 2 ' 2inJ c, (2 - o)"+‘ 
1«1< !• 


4 
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and the integraticm cf the aeries term by term now yields 


1 




a-n 


where we set 


_ y; 

ntl (Z - ar 


a-n = 


m 


-i 

2in Jc, ({• 


dt, n » 1, 2, .... 


This establishes the equality of the second terms in the right-hand members 
of (11-1) and (11-4), and the theorem is proved. 

We note that if f(z) is also analytic in the interior of the circle C 2 , then 
the integrand in (ll-t3) is an analytic function and hence o_n = 0 by 
Cauchy^s integral theorem. In this case (11-1) reduces to the Taylor 
series, since 

/(r) ^ 

(f — n! 

by (8-10). 

We can write the series (11-1) more compactly as 


On = T—<35 


/(«) = X) a „(2 - a)", 


( 11 - 6 ) 


where the On can be computed from the formula 

and r is any simple closed path ^ which lies in the ring and encloses C 2 . 
It is possible to prove that the representation of f{z) in a given circular 
ring in the series (11-6) is unique.* Hence if one obtains for f{z) a repre¬ 
sentation 

/( 2 )“ t.bn{z-ar 


in a certain ring with the center at a, the coefficients bn in this representa¬ 
tion must be identical with those given by formula (11-7). This frequently 
enables one to deduce the Laurent series without evaluating the integrals 
(11-7). 

^Recall that the integrals (11-2) and (11-3) have the same values when calculated 
over any path r into which Ct and C 2 may be deformed without leaving the ring. 

*See, for example, E. C. Titchmarsh, ‘The Theory of Functions,^* p. lOl, 2d ed., 
Oxford University Press, London, 1939. 
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W<» etmmpk, *• ftnd ki it be required to obtain the ex|)«nsioii 



Since e* ■■ 1 4* * + (^/2D -f * • * -f («*/n!) «f *»•, we have for any g s^O 






+ ••• + 



Thift 18 a Laurent expansion about the origin; hence it is the Laurent expansion about 
tbe origin. 

The Laurent expansion for valid for all |*t > 0, can be obtained from the series 

*« 1 -f w -f (m*/ 21) H-by letting u • 1/x. 

As another illustration, consider 


/W 


s 


( 11 - 8 ) 


This function has two singular points: x « 1 and x « 3. To obtain the Laurent series 

a» 

52 ®n(* 1)** valid in the neighborhood of x «• 1, we can proceed as follows. Set 

^x) m x/(x — 3), and expand «^(x) in Taylor's series about x « 1. The result is 

X 


3 


2 .tl 2»+‘ 


(11-9) 


Since x «* 3 is a angular point of <^ 2)1 we conclude that (11-9) converges as long as 
|x — 11 < 2. On multiplying this series by l/(x - 1), we get 


(X - l)(x - 3) " 2(X ~ 1) n^l ' 


which is valid for 0 < |x — 11 < 2. 

To obtain the expansion of/(x) in (11-8) about x « 3, we set ^(x) «■ x/(x ~ 1), expand 
it in Taylor's series about x - 3, and multiply the result by l/(x — 3). 

The expansion for /(x) in (11-8) valid for |x| > 3 can be deduced as follows: We de¬ 
compose/(<) into partial fractions and find 


(x - l)(x ~3)“'x~l“^x-3‘ 


( 11 - 10 ) 


But 

1 

1 1 

-i| 



for |xl > 1 

x-1 * 

* * 1 - (1/* ) 


and 

1 

1 1 

-i| 



1 for|x|>3. 

X — 3 

‘ * 1 - (3/*) 



Substitution of these series in (11-10) pelds the desired expansion. 

The reader may find it instructive to obtain the same expansion by vniting 

“ (f - 1)(» - 3) “ 11 - (1/.) r - (3/*) 

and formktg the {product of the appropriate series for the factors in the lii^t-hand 
member of (11-11). 
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1. Obtain Laurent’s esEpansions for /(s) ^ f)*]: (a) about f •« 0, (ft) about 

« « L 

8. Obtain Laurent’s expansion for valid for |s| > 0. 

8 . Expand in Laurent’s series about s ■« 1: (o) (s — 1)*, (ft) l/(s -« 1)*, (c) (* — 1)* -|- 
U/(* - 1)*1. 

4 . Obtain Laurent’s expansion for f{z) » !/{(* — 1)(* — 2)1 valid in the following 
regions: (o) 1* — 1 1 < 1, (ft) t*| > 2, (c) 1 < |s| < 2. Note that in (ft) and (c) the de- 

M 

sired expansions have the forms ^ OnS**. Hint: Show that 


/(*) ■» 




and 


1 ^ 1 y i 

* - 1 


for |sl > 1, 




z-2 2„‘rJV2; 

t. Show that/(*) - 1/{*’(1 


~~ s)] has the following expansions: 


(o) 53 0 < 1*1 < 1, 

n «->0 

(6) s 1*1 > 

n —0 * 


for 1*1 > 2. 


12. Singular Points. Residues. If x = a is a singular point of an 
analytic function f(z) and the neighborhood of * = a contains no other 
singular points of f(z), the singularity at z = a is said to be isolated. 

Thus, /(*) == l/z has an isolated singular point r = 0 because the region 
I r I = p > 0 contains no singular points other than r « 0 within it. The 
function 


m 


* — 1 


has three isolated singular points: * « 0, 2 « i, * » —i. The function 

f(z) « 


has two isolated singular points: 2 « 1 and z = --I, Not all singular 
points of analytic functions are isolated, however. For example, 




1 

sin (I/ 2 ) 


( 12 - 1 ) 


has a singularity whenever 2 « db(l/A;ir), A; « 1, 2, — These singular 
points are isolated. But (12-1) also has a singular point 2 » 0, which is 
not isolated, for, no matter how small the radius p of the circle 1 2 I « p may 
be, this circle contains infinitely many singular points 2 >» =fc(l/fcir) in its 
interior. 
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The fimetiiHi log has ft singularity at s « 0, and so does Vs, These 
singularities are not isolated because eveiy circle \z\^ p includes part 
of the positive real axis, upon crossing which the single-valued branches of 
log z and V* suffer discontinuities if the real axis is chosen to be the cut, 
as in Secs. 16 and 17. The points at which the branches of a multiple¬ 
valued function assume equal values are called the branch points^ For the 
present we shall restrict our considerations to single-valued functions. 

If s « a is an isolated singular point of /(s), then in the neighborhood 
of s a the function f{z) can be represented by the Laurent series 


m - E - a)» + E -r^~- (12-2) 

n«<0 n-l ^ 

Some coefficients in (12-2) may vanish, and there are two nontrivial cases 
that present themselves: 

1. The expansion (12-2) contains at most a finite number m of terms 
with negative powers of « — a, so that (12-2) reads 


® ft-i 0-2 

/(*) - E a»(* - «)’ +-+ 7- Zi 

•-0 z-a (z- ay 


f • * • + 


(z - a)** 


( 12 ^) 


2. The expansion (12-2) contains infinitely many terms with negative 
powers of « — a. 

The type of singularity at ^ * o characterised by the representation 
(12-3) is called a pole of order m. A pole of order 1 is also called a simple 
pole. When the expansion (12-2) has infinitely many terms with negative 
powers of 2 — o, the point z « o is called an essential singular point of f{z). 
We shall see in Sec. 14 that the behavior of a function in the neighborhood 
of a pole differs radically from that at an essential singular point. 

We note from (12-3) that whenever f{z) has a pole of order m, one can 
define a function ^ 

0(z) * (2 - a)’7(z), Z9^a, 

^(a) « a-m, 


which is analytic at 2 * a, but the function (2 — a)’”~V( 2 ) is not analytic 
at 2 « a. Tto property is used sometimes to define a pole of order m. 

The coefficient a-i in the Laurent representation (12-2) of f(z) in the 
neighborhood of an isolated singular point 2 » a plays an important role 
in the evaluation of integrals of analytic functions. This coefficient is 
called the residue of f{z) at 2 » a. 

When the singularity at 2 « o is a pole of order w, the residue at a can 

^ Whsn 2 •• a, the function 4{z) aasumes the indeterminate form 0/0. We agree to 
define #(«) «• lim ^ 2 ). 
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be detennioed without deducing the Laurent expansioD. Thus, on nkulti- 
plsdng (12-3) by (z — o)", we get 

*(*) * (2 - o)"y(z) 

- o_„ + o_,+i(2 - o) -i-h 0_,(* - o)"*“^ 4- ao(2 - o)" -I- 


(124) 


where o_« 9^ 0. Since this is a power-series representation of ^( 2 ), the 
coefficient o_i in it must be the coefficient of the term (2 — o)"*~* in the 
Taylor expansion of 4>(z) about 2 = 0 . Thus 


1 d"-M(2 - o)’7(2)l 

(in - 1)1 


(12-5) 


We formulate this result as a useful theorem: 

THEoamif. If <t>(z) = (2 — a)’^f(z) is analytic at z a and <tia) 5 ^ 0, 
then f(z) has a pole of order malz = a vrith tiie residue given by (12-5). 

As a special case of this theorem we note that when the pole at 2 « a 
is simple, the residue at a is given by the formula 


o_i = lim f(z){z — a). 


( 12 - 6 ) 


Example 1. Obtain the residues at the singular points of /(*) — (1 -b *)/[*(2 — *)]. 
This function has a simple pole at x — 0 inasmuch as 


^(x) 


1 -f- * 

*«(2 — t ) 


1 

2—2 


is analytic and does not vanish at 2 0. Also 


^( 2 ) 


(2 - 


2 ) 


1 4-* 
2(2 - 2 ) 


g-fl 

2 


is analytic at 2 » 2 and does not vanish for 2*2. Hence f{z) also has a simple pole 
at 2 » 2. 

The residues at these points can therefore be computed with the aid of the formula 
(12-6). We find that the residue at 2 * 0 is H *»d at 2 * 2 it is 
Example 2, The function 

/W " x« + l “ {X + i){t - t) 


obviously has simple poles at 2 

0-1 lim (2 — i) 


—i and 2 * ». Therefore the residue at 2 * i is 
«* c* 

- - -—.. ms UlIX . . . - ■■ -- 

(2 -f i)(z — t) I 2 -f * 2t 


Similarly, the residue at 2 * —» is found to be —«~'V2«. 

Example 3, The function /(s) * l/( 2(2 4 1)*] has a simple pole at 2 * 0, since 


^(*) 


1 1 
‘x(x + D* “ (X -h D* 
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is sasHiirtk! st s •> 0 and *« 0. IWefors, tlis naulae at * 0 is 


(OBAT. 7 


Tbs idngularity of /(a) at a 


a-i •• lim 


; - 1. 


4-1)* 

— 1 k & pole of order 2, stnoe 


Ml) - (* -h 1)* 


k Anolyttc at « -» —1 and ^-1) — —1. 
a *•» -♦I with the aid of (12-5). We get 


a-i 


lid* 


1 ^ 1 
a(l 4> *)* z 

We can therefore compute 



the residue at 


BxampU 4. The function (sin r)/s* has a pole of order 3 at a • 0 as the reader can 
easily c^k with the aid of the theorem of this section. Hence the residue at s «■ 0 
can be computed by using formula (12-5). It is simpler, however, in this case, to write 
out the lAurent expansion in the neighborhood of s - 0 and obtain the residue from it. 

Since sin s « s — (s*/3I) 4“ («*/60- 1 


""s* 8!s 5! 


for 1*1 > 0. 


It is dear from this that the singularity at s • 0 is a pole of order 3 with the residue 


-1/31. 

Example 5. The function 


has an isolated singular point at s «• 
that 


COBU 


1, This point, however, is not a pole, for on noting 




we Gonoittde by the substitution u •- 1/(2 — 1) that for I 2 — 11 > 0, 

1 , i . 1_ 

^ 2 — 1 * 21(2 - 1)* 41(2 - 1)* 

This is the desired Laurent expandon about f 1. Binoe it has infinitely many negative 
powers of 2 — 1, the point 2 «• 1 is an essential singular point. Inasmuch as the term 
(2 1)~^ does not appear in the expansion, the residue a«i at 2 » 1 is sero. 


PROBLEMS 


h Obtain the Laurent expansions in the neighborhood of the singular points of the 
following functions, and thus obtain the residues: 


(«) . W e-W, (c) , (<J) ^, («) ^, (/) AV. to) 

(0 2 ' i) ' *' ® 


1 - A 
** 


,W 


«• 

(*-!)*’ 
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%, Whenever posfiible, determine the reeadues et the polee of the fuoetioiu) In 1^. I 
by meaofl of formula (12-5). 

$. Obtain tbe rewdues in Examj^dae I, % and 3 of ibis neotiicm by deducing appropriate 
Laurent’s series. 

4u Prove the following theorem: If f(z) » g(z)/h(z) is the quotient of two functions 
analytic at s « o such that g(a) 0» fe(a) «• 0, and fc'(o) 0, then /(*) has a 8inq>le 
pole at * • a with the reridue p(a)//i'(o)\ Ei-ni: Examine the quotient of the Tayte 
expansions of g(z) and h(t) about s •• a. 

5, Use the theorem of Prob, 4 to show that /(*) » cot f cos s/sin i has simple 
poles at s •• sfcicir, it 0, 1, 2, .... 

6. Note that/(*) -» 1/(2 — s) -f l/(* — 1) has the Laurent expansion 

«e 1 «o 1 

/(*) - Z 5^*" + E IT 

^ 2*+* iZi *" 

valid in the ring 1 <|i| < 2. This expansion has the term 1/s. Does it follow that 
s >i 0 is a singular point of /(s) with the residue equal to 1? 

13. Residue Theorem. Let f(z) be analytic in the given closed region 
R bounded by C, except at the isolated singular points s = 2 i, s = zj, 

..., 2 = 2m- If these points zt are enclosed by circles T* (A: = 1,2. m), 

so that f(z) is analytic in the multiply connected region bounded by C and 
the r*, we know that 

^/(s) dz =« ^j.j/(2) dz + dz-\ -1- dz. (13-1) 

But from (11-7), on setting n - —1, we see that 

S{z)dz (13-2) 

2irt J r* 

where (a_i)it is the residue of f{z) at 2 = 2 *. We can thus write (13-1) 
in the form 

m 

(fj{2) cte « 2iri X) (a-.i)*. (13-*3) 

km.1 


The result embodied in this formula is known as the Residue Theoeem: 
The integral of f{z) over a contour C containing within it only isolated singular 
points of fiz) is equal to 2ri times the sum of the residues at these points. 

Inasmuch as the residues of f{z), as demonstrated in the preceding section, 
can often be easily calculated, we see that formula (13-3) provides a simple 
means for evaluating integrals of analytic functions with isolated singu¬ 
larities. 


Examide 1. 


Evaluate 


f 

Jc *{2 — t) 


df, 


where C is the circle 


1 . 


The only singular point of the integrand enclosed by C is s «• 0. In Examine 1 of 
Sec. 12 we saw that the residue of the integrand at« «>« 0 is Hence the value of the 
integral is (2iri)H • irt* The value of this integral over any path C enclosing * •» 0 
and f 2 is 2tHH - -2irt, sinoe the reridues at these points are H and — 
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J r 

' —— dz orer the dreukr path 1«| •> 2. 

4-1 

The reddues of the integrand at e «• t and i -i were computed in Example 2, 
See. 12. Henoe the value of the integral is 

-airisinl. 

\2i 2t / 

JSmmjde 8. Evaluate j cos 

We saw in Example 5 of Sec. 12 that s » 1 is an essential singular point with the 
residue aero. Hence the value of the integral is aero for every closed path C which 
does not pass through « - 1. If e - 1 lies on C, the integral is improper and other 
means have to be employed to determine its value. 


PROBUMS 


1, Use results of Prob. 1, Sec. 12, to obtain values of the following integrals where C 
is the circle \e\ * 2: 

« ™ Lfh -« «/c^-' «> 

f — 2 

8 . Determine the residues of /{*) ■■ --— at « » 0 and s ■■ 1, and thus evaluate 

z\t — 1) 

J f « — 2 

—-- dz, where C is the circle |s| « 2. 

c *(* - 1) 

f * + 1 

3. Evaluate the integrals / ^ dz (» — 1, 2), where Ci is the circle |s| ■* 1 and 

JCi * “* 2r 


C» is the circle |s| «■ 3. 

4. Find the value of / 

JC 

cirele Isi • 3. 


g-fl 

'c(* -2)» 


dz, where (o) C is the circle lr| ■■ 1, (6) C is the 


14* Behavior of fit) at Poles and Essential Singular Points. From 
Laurent’s representation (12-3) of f{z) in the neighborhood of a pole « =« a, 
we easily conclude that |/( 2 ) | becomes infinite as 2 —> a. The behavior 
of \f(z)\ with an essential singularity at 2 « a is different because the 
expansion (12-2) has infinitely many terms with negative powers of 2 — a. 
While it is true that in this case \f{z) ( as 2 a is also unbounded, the 
function 1 /( 2 ) [ oscillates as 2 —♦ a. Indeed, it was shown by E. Picard 
that in the neighborhood of an essential singular point, /(z) assumes any 
preassigned value, with the possible exception of one value, infinitely many 
times. A discussion of this would ca^ us too far in the study of analytic 
functions, and we merely illustrate this behavior by an example. Since 


f 


l/M 


1 

1+- + 

2 



1^1 > 0, 
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/(«) *» has an essential singular point at 2 ? « 0. We show that if A 
is any complex number not imro, there are infinitely many values of z 
in the neighborhood of e »» 0 such that 

« A, (14-1) 

for on taking the logarithm of (14-1) we get infinitely many solutions 


Log \A \ + i{4> 4“ 2kir) 
where ^ is the principal argument of A, 


0, =bl, ±2,..., 


GBOMSTRIC ASPECTS 


15. Geometric Representation* The usefulness of graphical representsr 
tion of real-valued functional relationships in the familiar three-dimensional 
space is too obvious to require emphasis. The customary mode of rep¬ 
resenting real functions by curves and surfaces fails, however, when one 
encounters functions of more than two independent variables. Thus, a 
relationsliip u = /(x,y, 2 j) containing three independent real variables z 
requires a four-dimensional space for geometric representation. Similar 
difficulties arise when one attempts to represent graphically complex 
functions w - fiz), with z ^ x + iy. For, to each pair of values 
there correspond two values (Ujv) in ii? = w + iv, and in order to plot a 
quadruplet of real values (w,v,a:,y) we need a four-dimensional space. 

However, a different mode of visualizing the relationship w « f{z) which 
utilizes two separate complex planes for the representation of z and w is 
possible. The relationship w = f{z) then establishes a connection between 
the points of a given region R in the z plane and another region Jf2' de¬ 
termined by «; = f{z) in the w plane. 

On separating w — f{z) into real and imaginary parts one obtains two 
real functions 


u « m ( x , v ), 
V ^ vix,y), 


(IW) 


which can be viewed as the equations of a transformation that maps a 
specified set of points in the xy plane into another set of points (w,v) in 
the uv plane. 

We turn now to this mode of studying complex functions. 

Example 1. Let wj « -f a, where o « ^ is a complex constant. 

We set ty «• 4* « *• aj + 

ti -f ■■ a: + ty H- ^ H- 

« Or -f- W -f % H- k). 
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(16-2) 


n «• «-f A, 

» » y + *. 

Formias (15-2) are the familiar equatioiis defining a translaHm, and the relationehip 
0 M « a oan be visualised as representing a rigid dieqplaoement of points in the z 
plane, where each point is moved h units in the direction of the x axis and k units in the 
direction of the y axis. 

EmmpU % To study the function to «> os, where a is a constant, it is convenient to 
use polar coordinates. 

We set s « to * pc^, a • At^ and get 

Hence p • ^ « a +». (IM) 

We see from (15-3) that the modulus of to is got by multiplying the modulus of s by il. 
Also the argument of to is got by adding a constant angle a to the argument B of z. 
We can visualise the transformation (15-3) as representing a stretching in the ratio A: 1 
accompanied by a rotation through an angle a. A square with the center at the origin 
in the z plane is thus deformed into a square, a circle of radius R is transformed into a 
circle of radius AR^ and more generally any figure is transformed into a similar figure 
fflalarged by the factor A. If 4 ■■ 1, we have a pure rotation through an angle a. 

The same conclusions can be reached (but less readily) by setting u> « u -b ir, s * 
X 4* ty, A di -f and by deducing from is « os the transformation 

« « oix - 02y, 

» - 09X -b oiy, 

in cartesian coordinates. 

EzMmpU 3. To study the relationship tp l/s, z ^ 0, we again use polar coordinates. 
On setting w - p«^, z - r«^, we get « (l/r) 0 “", so that 

1 

P - -B, (15-4) 

It is clear from (15-4) that the unit circle \z\"" 1 is transformed into the unit circle 
ltp| «» 1 in the w plane. Since ^ ■■ —the corresponding points on these circles are got 
by reflection in the axis of reals (Fig. 17). As the point A traces out the circle |z| «■ 1 in 
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the cl<»elc«^ direction, tbe cMVlhra^Kmding point ii' in the txx i^ne traces out the drde 
|t(r| >«• 1 in the coiuiterclodcwiee direction. Points in the interior oi |s| 1 are miipped 

into points in the exterior of |w|1, except that the transformation of the point 
z » 0 is not defined by u> « l/s. Points in the neighborhood of s « 0 map into pmnts 
at a great distance from the origin of the to plane, since p •» 1/r. To complete the 
correspondence of points, we can introduce a new point lo «■ «> as the correspondent of 
s 0. The point to » oo is called the poitU of infirvity. If we consider the inverse trans* 
formation z •« 1/to, we see that to «■ 0 corresponds to s ■» «. 

The reader can show that the equations of transformation defined by to m 1/f in 
cartesian coordinates have the form 


with the inverse 


X 

y 


x» + „i 

u 

V 

u«+o*’ 

^ “ It’ + »* 


(IM) 


PROBLEMS 


1 . Discuss the transformations defined by (a) to » (1 + 1 )«, (h) to « l/(a: — 1), 
(c) to « t/z, (d) 10 *« oz 4- 6. 

2. Show that every circle in the z plane maps by the transformation to ** 1 /z into a 
circle in the w plane if one considers straight lines as the limiting cases of circles. Hint: 
Write the general equation of the circle in cartesian coordinates, and make use of (15-5). 

3. Show that the bilinear tramformalion 

oz 4“ 6 

^ ^od — 5c 5^ 0, 

cz 4- d 

can be decomposed into successive transformations z' « cz 4- d, z" «• 1/z', to (a/c) 
•4 l(6c — ad)/c]z^\ which are the type studied in Examples 1, 2, and 3. Then oondude 
(see Prob. 2) that a bilinear transformation transforms circles into circles. Discuss the 
case when od — 6c « 0. 


16. Functions w - and z 
determined by the function 


« -V^- us study next the mapping 
w = (16-1) 


If we set « =* re^ and w =» pe^, we get 


so that 





^ ase 20, 


(16-2) 


It is clear from (16-2) that the upper half of the z plane maps into the whole 
w plane, for when z is in the upper half plane, the range of variation of 9 
is 0 < 9 < T. Since ^ = 2fl, we see that the arguments of the corresponding 
points in the to plane vaiy from 0 to 2ir. Points on the upper half of the 
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cirde \z \« r map into the entire circle 1^1 *» (Fig. 18). The half 
ray OA in the z plane maps into the half ray O'-A' in the w plane. A 
radial line OB, making an an^e B with the x axis, goes over into a radial 
line O'B', making an angle ^ =»= 2^ with the u axis. The interior of the 
quadrant OAC of the circle 1 2 j » 1 maps into the interior of the semicircle 
|ii>|« 1 in the upper half of the w plwe with the boundary ABC going 
over into the boundary A*B*C\ The segment OF of the negative real 
axis in the z plane maps into the segment O'F* along the positive u axis. 
To distinguish points on the positive u axis that correspond to points on 
the ray OA from those on OF, we can imagine that the w plane is slit 



Fig. 18 


along the positive u axis and suppose that the points corresponding to OA 
lie on the upper bank of the slit O'A' and that those corresponding to OF 
lie on the lower bank O'F'. 

The transformation of points determined by (16-1) can be visualized as 
a fanwise stretching of the upper half of the z plane in which the sector 
OAB opens into a sector O'A'B' and the half circle OACF is deformed into 
the whole circle ji^l = 1. The semicircles of radius r in the z plane go 
over into full circles of radius p == in the w plane. Points in the lower 
half of the circle \z\^ I map into the whole circle \w\— 1, inasmuch as 
the replacement of 8 by 0 -f ir in (16-2) yields - 26 + 2v, Thus, two 
distinct points B and (7 with the arguments 0 and 0 + x in the z plane 
correspond to one and the same point B' in the w plane. 

This is to be expected, since, on solving (16-1) for 2 , we get 

z = Vw, (16-3) 

which is a double-valued function. If we set u) = in (16-3), we get 
two values 

z - z = Vp c**^*'*^"^' = (16-4) 

For points along the u axis, the argument ^ = 0. Points on the upper 
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bank of the slit O'A' in Fig. 18 correspond to « Vp, and those of the 
lower bank O'F' to » — Vp« Thus, along the slit, z »* Vta is a dis¬ 
continuous function unless p =« 0 . 

The function 

w » n a positive integer, (16-5) 

can be studied in the same way. On setting z « re^, w « pe^ we find 


p = <f> == n$. 


(16-6) 


This time a wedge of angle 2r/n in the z plane (Fig. 19) maps into the 



whole of the w plane, and a circular arc ACB of radius R goes over into 
a full circle |u^| = 1?"^. An adjacent wedge OBD of angle 27r/n also maps 
into the whole w plane. If we divide the z plane into a set of n adjoining 
wedges, each of angle 2 ir/n, the entire z plane will be mapped into the w 
plane n times. 

Corresponding to a given point 5 ^ 0, there will be n values of z de¬ 
termined by the n roots 


z 




r/ * 2rk\ / 

<t> 2irk\" 

1 cos - + 

— ) + T sm 1 

- + - ) 

L \ n 

n / \ 

n n / J 


(16-7) 


with A;==0, 1, ...,n — 1. Each of these roots lies in one of the wedges 
into which the z plane is divided. 

Some further insight into the character of mapping by means of (16-1) 
can be gained by studying the maps of lines u = const, v == const. If 
we set 2 X + ty in (16-1), we find 


V =» 2xy, 


(16-8) 


so that the lines u » const, v «= const map into orthogonal h 3 rperbola 8 
^ const, 2xy « const. Some of these are shown in Fig. 20, in 
which the corresponding points are labeled by like letters. 




Fig. 20 
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17. Hie Fundioas w «> e* aad 2 ■■ log to. If we set 

w «» M + w sad 

z ^ X + iy in 

w «» 

(17-1) 

we get 

u + w ^ 6*’^**' « e*(co8 y + t sin y). 


Hence 

w » e* cos y, t; « e* sin y. 

(17-2) 


It follows from these equations that 


^ 

V 

- *= tan|^. 
u 


(17-3) 


Accordingly, the lines x =* const map into the circles « const in 

the w plane, and the lines y = const map into the radial lines v/u — const. 
Since 

^ ^M^2kri « 0, =bl, db2, ..(17-4) 


we see that w ^ has an imaginary period ^ 2irt. Hence, if the z plane is 
divided into horizontal strips of width 2ir, with the initial strip determined 
by 0 < 2 / < 2ir (Fig. 21), the relations (17*4) ensure that the behavior 



of w ^ e* in every strip 2kir < y < 2{k + 1)t, k »= dtl, db2, ..., is 
identical with that in the initial strip. Consequently, we can coniine our 
attention to the behavior of lo — e* in the initial strip 0 ^ < 2ir. 

A segment AC of a straight line x «= x© in the initial strip maps by (17-3) 
into a circle ^ The points A(xo^O), C(xo,2ir) correspond to 

the same point u » t; » 0 on the u axis. The segment OP of the 

* A« for real fuactions, we «ay that f(z) is periodic of period a if /(« + o) ■■ /(*), 
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y axis tmip& into the unit circle + p* » 1, since along OP x *» 0; the 
half strip x > 0, 0 < < 2ir, maps into the region >1. If x < 0, 

a segment such as QP in Fig. 21 maps into a circle whose radius is less than 
1. The half strip x < 0, 0 < y < 2ir, goes into the interior of the circle 
|u?| » 1. Points on the lines y « 0, y = 2 ir, forming the boundaries of the 
strip, map into points on the positive u axis. If we slit the w plane along 
the positive u axis, then the points on the upper bank of the slit correspond 
to points on the line y » 0 and those on the lower bank to points on y « 2 ir. 
The interior of the rectangle OACP in Fig. 21 corresponds to the interior 
of the ring between the circles w® + = 1 and 

We further note that a point moving along the x axis away from the 
origin 0 in the positive direction has for its image a point in the w plane 
that moves in the positive direction along the u axis away from the image 
O' on the unit circle. A point moving avmy from 0 in the direction of the 
negative x axis has for its image a point moving from O' toward the origin 
of the w plane. 

If we consider some definite point wq in the w plane, the equation 

“ e* (17-5) 

has for its solution 

z « logu^o =* Ix)g|t4?o| + + 2A;ir), « 0, drl, db2, ..., (17-6) 

where <l>o is the principal argument of wq. All these values of z differ only 
by the imaginary part, and therefore there is just one solution of (17-5) 
in each strip 2kv < y < 2{k •+■ 1 )%. The function 

z ^ log w 

is therefore infinitely-many-valued. If we restrict our attention to the 
slit w plane so that the argument 4) o(w lies between 0 and 27r, the mapping 
from the w plane to the z plane will be single-valued with just one image of 
log w in the fundamental strip 0 < 2 / < 27r of the 2 plane. 

To study the map oi w ^ log z we interchange the roles of the z and 
w planes in the foregoing discussion. We remark in conclusion that inas¬ 
much as all trigonometric functions of z are defined in terms of e*, a study 
of the mapping properties of such functions is reducible to the study of 
mapping by to « c®*. 

PROBLEMS 

1. Discuas in detail mapping by the function ti? « 2 *. 

2* Show that the function 



fnaps the circles { 2 I » const into confocal ellipses and the radial lines arg 2 9 » const 

into confocal hyperbolas. 
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8 . Prove that mn sf and tan z are periodic Cunctione. 

4. Show that the curvee uix^y) * const, t>0r,v) const in (16-8) intersect at right 
angles (Pig. 20). 


18. Conformal Maps. We noted in Sec. 15 that the relationship w « 
f(z) can be viewed as a mapping that sets up a correspondence between the 
points of the z and w planes. If ic = f(z) is analytic in some region R 
of the z plane, and if C is a curve in R, there is a remarkable connection 
between C and its image C' in the corresponding region R' in the w plane 
(Fig. 22). Consider a pair of points z and z + Az on C, and let the arc 



length between them be As = PQ. The corresponding points in the 
region R' are denoted by w and w + Aw, and the arc length between them 
by As' = P'Q\ Since the ratio of the arc lengths has the same limit as 
the ratio of the lengths of the corresponding chords, 


As' I Aw I 

lim — = lim -- 

At0 As A* 0 IA2: I 


lim 

A* —♦ 0 


Aw \ 


dw 

Az 1 


dz 


(18-1) 


We shall exclude from consideration those points of R at which dw/dz = 0 
because at such points the correspondence of values of z and w ceases to 
be one to one.^ 

Formula (18-1) shows that an element of arc through P, on being trans¬ 
formed to the w plane, suffers a change in length such that the magnification 
ratio is equal to the modulus of dw/dz at P, This ratio is the same for all 
curves 'passing through P, but ordinarily it varies from point to point in the 
z plane, since \dw/dz\ need not have the same value at all points of the 
z plane. 

We shall see next that the argument of dw/dz determines the orientation 
of the element of arc As' relative to As. The argument B of Az (Fig, 22) 

^ If dw/dz » f'{z) « 0 at some point P of H, then dz/dw « \/f'(z) is not dehned at the 
corresponding point P' for the inverse function z « F{%t}). Thus F{w) is not analytic at 
P\ Indeed, it can be shown that a necessary and sufficient condition for the existence 
of a unique differentiable solution of w «« f{z) at the jx^int 2 : » 20 is precisely /'(s^) ^ 0. 
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m iht aai^ made by the chord PQ with the positive direction of the a; 
axis, while the argument 0 ' of Aw is the an^e made by the corresponding 
chord P'Q' with the u axis. 

Henoei the difference between the angles 6 ' and 0 is equal to 


arg Aw — arg Az 



since the difference of the arguments of two complex numbers is equal to 
the argument of their quotient. As Az —> 0, the vectors Az and Aw tend 
to coincide with the tangents to C at P and C' at P', respectively, and 
hence arg dw/dz is the angle of rotation of the element of arc As' relative 
to As. It follows immediately from this statement that if Ci and C 2 are 
two curves which intersect at P at an angle t (Fig. 23), then the correspond¬ 



ing curves €[ and C 2 in the w plane also intersect at an angle r, for the 
tangents to these curves are rotated through the same angle. 

A transformation that preserves angles is called conformal^ and thus one 
can state the following theorem: 

Thjbjorem. The mapping performed by an analytic function f(z) is con¬ 
formal at all points of the z plane where f{z) 9 ^ 0. 

The angle-preserving property of the transformation by analytic func¬ 
tions has many important physical applications. We shall indicate several 
of these in the remaining sections of this chapter, and we merely note here 
that a number of results deducible analytically from Sec. 15, Chap. 5, 
follow directly from geometric considerations. 

For example, if an incompressible fluid with a velocity potential ^(a:,y) 
flows over a plane (so that Vx « d^/dx^ Vy » d^/dy), then it is known' 
that the streamlines %(x,y) «= const are directed at right angles to the 
bquipotential curves ^(x,y) ^ const. 

^ flee flee. 15, Ohap^ 5, and particularly Prob. 6 of that seotkm, 
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The orthogonality of the corves # « const and ^ * const in the z plane 
follows at once from the conformal properties of transformations by 
analytic functions. It was shown ^ that the functions ^ and ^ satisfy 
the Catichy-Riemann equations. One can therefore assert that ^ and ^ 
are the real and imaginary parts, respectively, of some analytic function 
w f{z ); that is, 

f(z) « Hx,y) + i^{x,y). 

But the curves $ == const and = const represent a net of orthogonal 
lines (Fig. 24) parallel to the coordinate axes in the w plane, and they are 



transformed by the analytic function w = ^{x^y) 4* i^ix^y) into a net of 
orthogonal curves in the z plane. 

We saw in Sec. 9 that the real and imaginary parts of every analytic 
function f{z) — uix^y) + ivix^y) are harmonic; that is, they satisfy La* 
placets equation in tlie region where f{z) is analytic. Since solutions of 
Laplace's equation are demanded in numerous practical problems, analytic 
functions serve as a useful apparatus for producing such solutions. For 
example, if we take 


It) «= w + IV == sin 2 = sin (a: +• iy) 
then a + IV « sin x cos iy + cos x sin iy 

« sin x cosh y + i cos x sinh y. 

The harmonic functions u *= sin x cosh y, v = cos x sinh y are of special 
interest in deducing solutions of Laplace's equation in rectangular regions.® 
Further importance of conformal transformation by analytic functions 
derives from the fact that a harmonic function remains harmonic when 
subjected to such a transformation. If a function ^(u,v) satisfies Laplace's 


iSee Eq. (15-10), Sec. 16, Chap. 6. 
* See, for example, Sec. 20. 
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d^<tt 
du^ dv^ 


0 


(18-2) 


in some iregion fi' of tlie uv plane, then <!> still satisfies Laplace^s equation, 
in the appropriate region R of the xy plane, when the variables n, v in 
are related to x, y by an analytic function 

w ^ u + iv — f(z). (18-3) 

To see this, construct an analytic function 

F{w) = <#>(u,t>) + iypiUfV) (18-4) 

by calculating the conjugate ^(w,v) of the harmonic function <j^(w,t;). 

The substitution from (18-3) in (18-4) yields 

F[f(z)] = 4>(x,y) + i4^(x,2/), (18-5) 

which is analytic in the region R of the xy plane into which the region 
R' is mapped by (18-3). The function 4>(x,y), being the real part of the 
analytic function F[f{z)], is harmonic. 

This property of the transformation of harmonic functions by means of 
analytic functions is of the utmost practical importance; for, suppose that 
we are required to find a solution of Laplace's equation (18-2) such 

that on the boundary C' of some complicated region R' in the uv plane, 
<f>(UfV) assumes specified values. If it should prove possible to find a func¬ 
tion w = f(z) which maps the region R' conformally into some simple 
region R (a circle, for example) in the z plane, it may be relatively easy to 
determine the transform 4>(x,t/) of *^(u,v) in the region R with proper values 
of on the boundary C. 

If <^(x,2/) is so determined, the function <^(u,v) can be obtained by re¬ 
placing the variables in 4»(x,y) by their values in terms of u and v. It is a 
remarkable fact, first discovered by Riemann, that every simply connected 
region fi' (with more than one boundary point) can be mapped conformally 
onto the unit circle \z\< 1 in such a way that the boundary C' corresponds 
to the circular boundary \z\^ 1. 

We shall sketch this mode of solution of the Dirichlet problem in Sec. 21. 


PROBLEMS 

1 , Obtain solutions of Laplace’s equation from (o) tu cos s, (6) tu e*, (c) to ■■ 

{d) to «»« log e, (e) to «« l/z. 

2. Construct the conjugate harmonic functions v(z,y) for the following functions: 
(ff) a «• cos X cosh y; (5) u « c* cos j/; (c) u » y -1- e* cos y; (d) u » eosh x cos y. 

2. Examine the mapping by w and tu » ^ at « »• 0. Is it conformal at s « 0? 
'ilxamine the behavior of the maps of rays issuing from s «« 0. What are the ratios of 
magnification of the arc elements at is » 1, s «« 1 -f t, x «* «? 
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19. Steady Flow of Ideal Fluids. We discussed the flow of nonviscous 
incompressible fluids in Sec. 15 of Chap. 5, where we introduced the con¬ 
cept of the velocity potential and the stream function 

These functions were shown to be related by tlie Cauchy-Riemann equa¬ 
tions 

a# a0 ^ a^ 

dx dy dy dx 

It follows from (19-1) that 

F(z) « H^,y) + i^ix.y) 


(19-1) 

(19-2) 


is an analytic function of a complex variable z — x + iy. We shall call 
F(z) the complex potential and show that its derivative is related simply to 
the velocity vector v - of the fluid particles. 

By (4-5), 


dF a$ . a^ 
dz dx dx 


(19-3) 


and, since v = V<l>, so that 

d^ d^ d^ 

as -- y as - as-- 

dx dy dx 

we can write (19-3) in the form 

dF 

— = Vx “*• (19-4) 

dz 


We shall see in Sec. 21 that because of the simplicity of the complex- 
variable theory in comparison with the theory of real functions, it is often 
simpler to calculate the complex potential F{z) than it is to determine 
either of the real functions ^{x,y) or ^(x,y). This determination depends 
on certain so-called boundary conditions^ which are now to be described. 
We first recall ^ that since v == V4> is orthogonal to the curves 4‘(x,y) = const 
and these curves are orthogonal to the curves ^{x,y) = const, the vector 
V is tangent to the curves ^(x,y) == const. Hence these curves, called 
streamlines, are the paths of the fluid particles. When a sheet of fluid flows 
past an impenetrable obstacle C (cf. Fig. 25, Sec. 20), the fluid particles 
must flow along the obstacle and hence the boundary C must coincide with 
one of the streamlines. Thus the equation of one of the streamlines, say 

^(x,y) fc, (19-6) 

must coincide with the equation of the boundary C. 

^ See Sec. 3, Chap. 5, and Sec. 18 of this chapter. 
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To detemune S^(a;,2^) we must then seek a solution of Laplace^s equation 




(IM) 


in the region exterior to the obstacle, which is such that on the boundary C 
^ takes on a constant value. 

This suggests an indirect mode of solution of the steady-fluid-flow 
problems. One examines the shapes of curves « const for various 

harmonic functions ^(a:,y), and if a particular curve '^{x^y) « k coincides 
with the boundary C of an obstacle of special technical interest, then the 
function ^{x^y) solves a special problem. 

It follows from these remarks that any streamline ^{x^y) «= const can 
be regarded as a rigid boimdary of some obstacle. 

Instead of determining the stream function ^(a:,y), we can equally well 
determine a harmonic function ^{x^y) which on the boundary C satisfies 
the condition 

— - 0, (19-7) 

dn 


where n is the unit normal to C, for the statement that the obstacle is 
rigid implies that the normal component Vn of v must vanish along C, 
since no particles of fluid can cross ( 7 . But rn *= u • v, and since v =* V<I> 
and 

d^ 

— * O’V# « Vn, 
dn 


we see that (19-7) must hold on C. 

It should be noted that we have assumed in the foregoing that there are 
no sources or sinks in the region and that the fluid is incompressible. 
Moreover, the flow is irrotational, and hence ^{x^y) and '^{x^y) are single¬ 
valued functions. These considerations can be extended to the more 
general situation in which circulation is present. However, as we shall see 
from examples in the following section, the complex potential F{z) will 
then no longer be a single-valued function of z. 


PROBLEM 

Deduce from the boundary condition (19-7) that d^/dn ■> 0 along C, so that ^ «• 
const on C. Hint: Note that d^/dn ■■ (d^/dx)(dx/dn) + {d^/dy){dy/dn). Make use of 
(19-1), and observe that dx/dn «■ dy/d9, dy/dn « —cfcr/ds on C. 

20. The Method of Conjugate Functlona. We observed in the preceding 
Section that every analytic function F{z) « u(x,y) -f iv(x,y) can be 
aesociated with some flow pattern of an incompressible fluid. In fact, 
every such function determines two flow patterns, since either of the 
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harmoaaic fuacticma u(x,y)f viw,y) can be regarded aadetermiB^ the stream¬ 
lines. 

The simplest example of an irrotational flow is furnished by the function 


F(z) « cr 


where c is a real constant. Since z ^ x -h iy^ we have ^ ^ cxy ^ ^ cy^ 
and thus the curves ^ » const are straight lines parallel to the x axis. 
The formula (19-4) for the velocity of the fluid yields Vx « c, Vy « 0, so 
that the flow is parallel to the x axis. Since div v =» 0 and curl v » 0, 
there are no sources or sinks in the region and the flow is irrotational. 

As a more interesting example, consider 




c > 0, > 0. 


If we set 2 » re*® in (20-1), we easily find that 


/ a\ ( a\ . 

sr. c I r + —1 cos Bj ^ ^ c\r -1 sm 6. 


For r « a, we have ^ « 0, and hence the boundary of the circle r « a 
is a streamline. The pattern of streamlines is shovn in Fig. 25 by the 
solid lines, and the curves ^ « const are indicated by the dashed lines. 
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This flow pattern corresponds to a flow around a circular cylinder, 
velocity components are determined from 


nz) 



The 


It is easy to verify that div v = 0 and curl v == 0, so that the flow is 
irrotaticW. The points for which - Vy ^ 0 are z - d=a. These are 
called the niagnatim points. 

Let us investigate next the flow pattern determined by 

F{z) ^ clogz — u iVy z - re^y (20-2) 

where c is a real constant. 

If we consider only the one branch of this multiple-valued function for 
which 0 < 0 < 2ir, we get 

F{z) = c(Log r + iB)y 

j 30 that u — c Log r, v — cBy 0 < 0 < 2t. 

If we set ^ cBy then the streamlines ^ = const are the radial lines 
and the curves »=» const are circles c Log r = const (Fig. 26). By Eq. 



(15-1) of Chap. 5, the amount of the fluid crossing per second any closed 
curve C is 


V =j^(v^dy- 



d'J'. 


But ^ » c0, 80 that 

V ^cf de. 
Jc 
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This integral vanishes for any path that does not enclose the origin. If 
the origin 2 » 0 is within C, then V «« 2vc. Hence for c > 0, the flow is 
outward and we have a source of strength 2irc at the origin. For c < 0, 
we have a sink of the same strength. Thus, div v « 0 at all points except 
2 « 0. 

The circulation J is given by the integral ^ 


J -^liv^dx + vydy) = 

and since ^ - c Log r^J — 0 and the flow is irrotational. 

If, however, we take ^ ^ c$ and ^ ^ c Log r, the roles of the curves 
<l> const and ^ = const in the preceding discussion are interchanged. 
We thus conclude that for this flow the circulation J = 2irc if C encloses 
the origin. This corresponds to the situation described as a paint vortex 
at the origin. 

The reader will find it of interest to study the function 

4> -f tl' = c ^2 H-^ — zc' log 2 , a > 0, c > 0, 


for which ^ = const when \z\^ a. The function represents a 

flow around a circular cylinder r « a with the circulation 2irc'. 

As further examples of functions yielding useful solutions of interesting 
physical problems consider the following: 

1. The Transformation w » coshz. Here 


Thus, 


g* g * 

w «- 

2 


cosh 2 , 


so that 


or 


u + w = cosh (x + ty) ■*= cosh x cosh iy + sinh x sinh iy 
*» cosh X cos y + i sinh x sin y, 
u «= cosh x cos y, 

V ^ sinh x sin y, 


u* 


+ 


cosh^ X sinh* x 


cos* y sin* y 


1 , 

L 


* See Sec. 10, Chap. 5. 
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This traiisfoiinatiioii is dbown in Fig. 27, and it may be used to obtain the 
electrostatic field due to an elliptic cylinder, the electrostatic field due to 
a charged plane from which a strip has been removed, the circulation of 
liquid around an elliptic cylinder, the flow of liquid through a slit in a 
plane, etc. 

The taansfonnation from the z plane to the w plane may be described geo¬ 
metrically as follows: Consider the horiaontal strip of the z plane between 
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the lines y « 0 and y « r, and think of these lines as being broken and 
pivoted at the points where r =» 0. Rotate the strip 90’’ counterclockwise, 
and at the same time fold each of the broken lines y = 0 and y » x back 
on itself, the strip thus being doubly '‘fanned out** so as to cover the 
entire w plane. 

It is interesting to note that this same transformation w = cosh z can 
be used to solve a hydrodynamic problem of a different sort. When liquid 
seeps through a porous soil, it is found that the component in any direction 
of the velocity of the liquid is proportional to the negative pressure gra¬ 
dient in that same direction. Thus, in a problem of two-dimensional 
flow the velocity components (tt,v) are 


u 



V 


dy 


If these values are inserted in the equation of continuity, namely, in the 
equation 

du dv 


the result is 


dz 




d^p 


0. 
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Suppose, then, one considers the problem of the seepage flow under a 
gravity dam which rests on material that permits such seepage. One 
seeks (see Fig. 28) a function p that satisfies lisplace's equation and that 



satisfies certain boundary conditions on the surface of the ground. That 
is, the pressure must be uniform on the surface of the ground upstream 
from the heel of the dam and zero on the surface of tlie ground down¬ 
stream from the toe of the dam. If we choose a system of cartesian co¬ 
ordinates u, V with origin at the mid-point of the base of the dam (Fig. 
28) and u axis on the surface of the ground, then it is easily checked that 
the function p{UfV) =* Poyi^^)/^^^ where 

w ^ u + iv ^ a cosh (x + iy)^ 


satisfies the demands of the problem. In fact, it was seen in the study of 
the transformation w » cosh z that the line y — v of the z plane folds 
up to produce the portion to the left of = —1 of the u axis in the w 
plane and the line y ~ 0 of the z plane folds up to produce the portion to 
the right of u =* +1 of the u axis. The introduction of the factor a in 
the transformation merely makes the width of the base of the dam 2a 
rather than 2. These remarks show that p(u,v) reduces to the constant 
T on the surface of the ground upstream from the heel of the dam. If the 
head above the dam is such as to produce a hydrostatic pressure po) one 
merely has to set 

, , VoVM 

p(u,v) -- 


One can now find the distribution of uplift pressure across the base of the 
dam. In fact, the ba.se of the dam is the representation, in the nv plane, 
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of the lioe x«0, of the xy plane. Hence, on the base of the 

dam the equations 

u — a cosh X cos y. 


reduce to 


a =* a sinh x sin 

u = a cos y, 

v == 0, 


so that p(u,0)=“-co8 

TT a 

This curve is drawn in the figure. The total uplift force (per foot of 
dam) is 


Po r+a _ u 

=r — / cos "du — poa, 
TT •'-« a 


which is what the uplift pressure would be if the entire base of the dam 
were subjected to a head ju.st one-half of the head above tin' dam or if 
the pressure decreased uniformly (linearly) from the statu* head po at the 
heel to the value zero at the toe. The point of application of the resultant 
uplift is easily calculated to be at a distance h = 3a/4 from the heel of the 
dam.^ 

2. The Transformation ic = z 4 c^. One has 
u + iv - X + iy + c* ^ 

= X + fy + c*(co.s y + i sin y), 
so that u = a* + c* (‘os ?/, 

V — y + sm y. 

This transformation is shown in Fig 29 If one considers the portion 
of the z plane between the lines y = drv, then the portion of the strip to 
the right of x = —1 is to be “fanned out’' by rotating the portion of 
y « 4-1 (to the right of x ~ —1) counterclockwise and the portion of 
y sz —1 (to the right of x = —1) clockwise until each line is folded back 
on itself. This transformation gives the electrostatic field at the edge 
of a parallel-plate condenser, the flow of liquid out of a channel into an 
open sea, etc. 

1 Some material in Secs 18 to 20 is taken by permission from a lecture by Dr. Warren 
Weaver printed in the October, 1032, issue of the American Maihematxml Monthly. 
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PROBLEMS 

1. Study the flow determined by the complex potential ly » cs* in a quadrant a; > 0, 
y > 0. The function «« 2cxy can be associated with the flow of fluid around a comer. 

2. Study the flow determined by the complex potential ty « c sin * in the semi- 
infinite region [xj < v/2, y >0. 

21. The Problem of Dirichlet The procedure for reducing solutions oi 
physical problems described in the preceding section is indirect. It depends 
on the examination of various harmonic functions that satisfy the boundary 
conditions appearing in specific physical situations. 

In this section we outline a general procedure for constructing harmonic 
functions which assume preassigned boundary values. Thus, let it be 
required to determine a solution of Laplace^s equation 

V^^{x,y) = 0 ( 21 - 1 ) 

which on the boundary C of a given simply connected region R assumes 
preassigned continuous values 

$ - ( 21 - 2 ) 

The variable s in (21-2) may be thought to be the arc-parameter a measured 
along C from some fixed point. 

The boundary-value problem characterized by Eqs. (21-1) and (21-2) 
is known as the Dirichlet problem, and it can be shown that the solution 
of it exists and is unique whenever the boundary C is sufficiently smooth. 
These conditions are usually met in physical problems. 

We first outline a solution of this problem for the case when the region 
ti is the unit circle \z\< I and later indicate how this solution can be 
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generalized to yield a solution of the Diriehlet problem for an arbitrary 
simply connected region with the aid of conformal mapping. 

Thus, let it be required to construct in the circle j z j < 1 a harmonic 
function such that on its boundary 7 (Fig. 30) 

Hx,y) = fie), ( 21 - 3 ) 

where f{e) is a specified function of the polar angle d. 



Instead of determining it proves more convenient to determine 

an analytic function 

F(zj = l^l < 1 (21-4) 

whose real part takes on prea^signed values (21-3) and then compute 
^{x,y) by separating F{z) into its real and imaginary parts. Now, since * 
F{z) + F{z) =« 2^{x,y)y we can write tlie boundary condition (21-3) in the 
form 

F(n + 2 m (21-5) 

where f « represents the values of 2 « on the boundary 7 . If 

1 dr , 

we now mu\lip\y \>ol\\ memWs oi (2\-5) by-, where z is an 

!2irir - z 

'mtenor point oS the circle, and integrate over y, WO 

»We use bars to denote the conjugate values, so that ^ « H^,y) - 
* I'o prove that the conditions (21-5) and (21-6) are equivalent, one must impose 
certain continuity restnetions on J{$) usually met in the physical problems^ See, for 
example, I. S. Sokolaikoff, ’‘Mathematical Theory of BiaetM^y ” 2dod,. p* l4tS. MoGraw- 
ElU Book Company, Inc., New York, 1956. 
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2ri -'T f — z 2ut •'t f — « TTt -- z 


dt. 


( 21 - 6 ) 


By Cauchy^s integral formula, the first integral in the left-hand member 
)f (21-6) is eq ual to F(z), We show next that the second integral has a 
constant value F(0) as long as | r| < 1. On expanding F({') in Maclaurin’s 
series, we get 

F(z) - F(0) + F'{Q)t + - F'iOy +... + -i/!'(»>(0)2" +■■■ (21-7) 
21 n! 


ivhich is convergent for all |s| < 1, since Fiz) is assumed to be analytic 
in \z\ < 1, 

If we set 2 =» f in (21-7) and form the conjugate /^(f), we get 

2! n! 

But on the circle 7 , f = “ l/«*^ 1/f, so that 

^ + + + (21-8) 
i* 2! r t 

The substitution of this series in the numerator of the second integral 
in (21-6) then yields a series of integrals of the form 


1 1 f F^{0) 

- / - 

n\ 2Ti A (f — 2 )f” 


0 , 1 , 


But the application of the residue theorem shows that these integrals 
vanish for w > 1, and forn « 0 we get 


2lrt •'T f — 2 


F{0) = oo - ibo. 


Thus (21-6) can be written in the form 


F(2) 


1 fJE- 

ri 'y^ — z 


(if — Oo ihoj 


(21-9) 


where oo + ibo = F{0). 

The real part oo of ^(0) can be determined explicitly in terms of the pre¬ 
scribed values /(#) on y, for on, setting 2 = 0 in (21-9), we get 

1 rm 



5dg 

and therefore 
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But f 


Oo * 


2in 



so that df/f « i d$ and hence 


Uq 


1 /•2ir 

- / m de, 

2 t •'0 


( 21 - 10 ) 


Accordingijs the real part of F{z) is determined uniquely when /($) is 
known. The real part of Fiz) is the desired harmonic function 
Since f ~ e"”®, f{0) can be expressed as a function of f, say and we 
see that the integral in (21-9) has the form 



Integrals of this type can frequently be evaluated in closed form with the 
aid of the theory of residues. 

Formula (21-9) thus solves the general Diri(‘hlet problem for the cir¬ 
cular region. 

We indicate next how the Dirichlet problem for an arbitrary simply 
connected region R can be solved when the function 

w « w( 2 ) ( 21 - 11 ) 

mapping the region R in the complex w plane conformally onto the circle 
l^l < 1 is known. Let w — u + iv; then the desinxi harmonic function 
4>(u,e), assuming the prescribed values 

Hu,v) - 0CsO (21-12) 

on the boundary C of ft, is the real part of some analytic function 

JF(ic) ^ + H'(UyV). (21-13) 

On substituting in SF(ic) from (21-11), we get 

^I[^C(^)] ^ F(2), 

which is analytic in the circle 1^1 < 1 . 

The values of the real part of F{z) on the boundary 7 of the unit circle 
are known, since the values of <^(w,l’) on the boundary C are specified by 
(21-12) and the points on C are mai>ped into ix)ints on 7 by (21-11). We 
can thus write the boundary condition ( 21 - 12 ) in the form 

^ = f(d) on 7 . 

The substitution of f{0) in formula (21-9) then yields F(«). To obtain 
the desired function 4>(u,i;), we must calculate the real part of SI(i«), which 
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can be determined from Fiz) by expressing z in terms of w with the aid 
of (21-11). 

It is clear that the solution of the problem of Dirichlet for an arbitrary 
simply connected domain hinges on the construction of a suitable mapping 
function (21-11). The fact that such a function exists is guaranteed by 
Riemann's theorem mentioned in the concluding paragraphs of Sec. 18. 
During the past 30 years considerable attention has been given to the 
problem of developing effective methods for constructing conformal maps 
for simply connected domains.^ A formula for conformal mapping 
of a polygonal region on the unit circle (or alternatively, in the upper 
half of the complex plane) has been supplied * by H. A. Schwarz (1843- 
1921) and E, B. Christoffel (1829-1900). 

During recent years extensive applications of complex variables to 
broad classes of problems in the theory of elasticity have been made.* 


PROBLEMS 


1. Use formula (21-9) to compute harmonic functions ^{x,y) in the circular region 

4- = 1, which assume on its boundary the following values: (o) <t> « 

(6) 4> — X® (c) ^ cos* where 0 is the polar angle. Hint: Note that x « i)^ 

y ( 1 / 20(2 — f) and that on the boundary of the unit circle I ^ Xjz. 

2. Set 2 « e® r(coK </> inin 4>)i T « cos d 4“ i sin 6 in (21-9); take account 

Of (21-10); and show that the real part ^ of Fiz) is 

„ 1 

2r Jo 1 — 2r cos (<? — 0) -f r* 

This formula, giving the values of harmonic function «l> at every interior point (r,<^) of 
the unit circle in terms of the assigned boundary values/(d), is known as Poissmi's inte- 
gral formula, (Cf. Chap. 6, Sec, 12.) liecause of the difficulty of evaluating real inte¬ 
grals, this formula is generally less useful than the Schwarz formula (21-9). 

22. Evaluation of Real Integrals by the Residue Theorem. Formula 
(21-9) and the problems in Sec. 21 suggest the use of contour integration of 
complex functions in the calculation of certain real integrals. 

Thus, consider a real integral 


r2x 

I F(sin cos d) dd 

Jo 


(22-1) 


^ There is a vast literature on this subject, and we cite only a book by L. V. Kantoro¬ 
vich and V. I. Krylov, ‘Approximate Methods of Higher Analysis,” Groningen, 1958, 
containing a comprehensive survey of the problem in chap. 5. A useful catalogue of 
mapping functions is contained in the “Dictionary of Conformal Representation,” 
Dover Press, New York, 1952, compiled by H. Kober. 

* This formula is contained in most books on complex-variable theory. See, for exam¬ 
ple, E. V, Churchill, “Introduction to Complex Variables and Applications,” chap. 10, 
McGraw-Hill Book Company, Inc., New York, 1948. 

* See Sokolnikoff, op. cU. 
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in wliicli jP ifi the quotient of two polynomialjs in And and oo®^. The 
evaluation of such integrals, as we shall presently see, can be reduced to 
the oaloulation of the integral of a rational function of z along the unit 
circle jal» L Since rational functions have no singularities other than 
poles, the residue theorem (13-3) provides a simple means for evaluating 
integrals of the form (22-1). 

We set « a* e^, so that dz » e^i dO 


dz 

or dd «(22-2) 

tz 

and we recall Euler's formulas, 

,j, 2”"^ Z ““ 

COS $ «-» sin 6 -- (22-3) 

2 2i 


On inserting from (22-2) and (22-3) in (22-1) we get the integral 


I K( 2 ) dz 


(22-4) 


in which R{z) is a rational function of z and C is the circular path \z\ « 1. 
If the sum of the residues of R{z) at the poles witliin the circle l^l < 1 
is denoted by Sr, the residue theorem yields R(z)dz == 2irt Sr, so that 

J^*V(8in B, cos S) cte = 2irt2r. (22-5) 


Example L As a specific illustration of this method of cadculsting integrals of the 
type (22-1), consider 


/ 



de 

-}- a sin ^ 


0 < a < 1. 


( 22 - 6 ) 


On making substitutions in (22-6) from (22-2) and (22-3), we get the integral 


/ 


f -^- 

Jc ^^[1 + a{z — z *)/2f) 

2 r dz 

a J <7 ** + (2i/a)z — 1 


where C is the circular path 1*| « 1. 

Since the roots of 4- {2ifa)z — 1 « 0 are 


Zi 


■ a - Vi - «*), 


-i(l -f Vl - a*), 


(22-7) 


( 22 - 8 ) 


2 r dz 
aJc(z- *t>(* - zt)' 


we can write (22-7) as 


/ 


(22-9) 
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But it Is dear from (22-S) that to 0 < cr < 1 ws hsm ]«i| < 1 am} |s)| > so that 
only one pole » • «i of the integrand 

JIU) m - 1 - 

lies within the unit circle. The residue of R(z) at s «• *i, by (12-6), is 

1 


r lim B(z)(z — Si) - 

#1 *1 - «* 


which, on noting (22-6), yields 


2iVl-c^ 


By the residue theorem, the value of (22-9), which is the same as that of the integral 

(22^). to 2 2^ 

a \/i - a* 

The reader can verify by the same method, or by setting ^ ^ — v/2, that 

de ^ d$ ^ 2x 

Jq l+acosS Jo l+asind — 0 ^* 


0 <a<h (22-10) 


The infinite integral 

/ « fix) 

( 22 - 11 ) 

-00 g(x) 

in which f(x) and g(x) are pol 5 nttomials in x, can also be evaluated by 
calculating the residues. It should be noted that the integral (22rll) 
converges if, and only if,‘ g(z) » 0 
has no real roots and the degree 
of g{x) is at least two greater than 
that of/(x). 

Now, consider the complex ra¬ 
tional function 

R{z) « ^ (22-12) 

which, obviously, assumes along the 
real axis the same values as the 
integrand in (22-11). By hypothesis g{z) *= 0 has no real roots; hence no 
poles of R{z) lie on the real axis. We form the integral 

/ B{z)dz^ I ~dz 
JC Jcg(z) 

where the path C is the boundary of the semicircular region in the upper 
half of the z plane shown in Fig. 31. Since all roots of g{z) lie at a finite 

^This follows directly from the usual tests on convergence of inu^roper integrals. 
See Chap. 2, Sec. 8. 




602 


CQMPI^EX; VARIABLE 


(chap. 7 

distance from the origin, we can take the radius R of the semicircle Cu 
so great that all poles of R{z) = f{z)/g(z), in the upper half of the z plane, 
lie within the semicircle. If the sum of the residues at these poles is Zr, 
the residue theorem yields 


r JKZ) /•« , f f{z) . ^ _ 

/ — / -dx + / - dz == 2iriZr. 

•'o a(z) a(x) ■'O© a(z) 


(22-13) 


• g(z) J-R g(x) ' JCk g{z) 

We show next that when the degree of g(z) is at least 2 greater than 
that of /(z), the integral dz 0 as Z? a, so that formula 

(22-13) then yields 


g(z) 

gi^) 


/ 

•'—00 


dx = 27rf2r. 


For proof, set z = Re^^ in R{z) = f{z)/g{z), and note that 


m 


g(z) 


M 

W 


M const, 


when R is sufficiently large. Hence, by (5-8) 


1 

<f 

M 

i g{z) 1 

-•'Cjej 

¥ 


ilf Mir 


(22-14) 


from which it follows that the integral over Or tends to zero as R —^ 
Thus under the stated restrictions on f(z) and g{z) ensuring the convergence 
of (22-11), formula (22-14) is true. 

An improi>er integral like (22-11) should be understood in th(‘ sense 


/ Kz f(r) 

(22-15; 

-Ri {/(-r) 


where Ri and Rt approach infinity m any manner. However, the method of ealnulation 
indicated in the text actually gives 


lim 

R w 


L 


R g(^) 


dx 


(22-1 r>a) 


so that «* /?2 in (22-15). The expression (22-3 5a) is termed the Cauchy pruicipal 
value of (22-15). If (22-15) exists (as in the case considered in the text) then obviously 
(22-15a) exists and has the same value. But (22-15a) may exist when (22-15) does not; 
for example, take f{x) »» x, g{x) « 1 + 

Example 2. To illustrate the use of formula (22-14) consider an elementary integral, 


L 


dx 

1 + ar* 


B(s) 


1 1 
1 ** (a: + i)(2t - 2)' 


Her© 
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so that the only singularity of R(x) in the upper half plane Is th^ mmple pole at s i. 
Since the residue of B(z) at s >» i is l/2t, formula (22*14) yields 

1 -f X* 2t 

The essential considerations that have led us to formula (22-14) are: 

1. The integral over the semicircular boundary C/^ in (22-13) approaches 

zero as 00 * 

2. The singularities of the integrand in the upper half of the z plane are 
isolated and are at a finite distance from the origin. 

3. There are no singular points on the real axis. 

Clearly, the same procedure can be used to evaluate integrals of the form 


r F(x) dx 

J —00 

by computing F{z) dz as long as the integrand F{z) satisfies conditions 

1, 2, and 3, Occasionally, a slight modification of the procedure 
outlined above can be used when |F(2)| is not sufficiently small in the 
upper half of the z plane, so that the condition 1 is not fulfilled by F(z), 
We illustrate this in the following example. 


Example 3. Evaluate 


r 

J —ec 


COS X 
-f X* 


dx, 


a > 0. 


If we take F(z) «» (cos 2 )/(a^ -f- the method outlined above cannot be applied 
directly, since |cos2| « “f becomes infinite when z —> » along the p axis. 

However, since cos x is the real part of c**, we can write 



cos X 

“f 


dx 





dx 


(22*16) 


where Re stands for the ‘'real part of,” 
Now, if we take 

(22-17) 

then |e«| » * |e~v| < 1 if y > 0. 

Thus, F(z) in (22-17) is bounded in the 
upper half of the z plane, and there is no 

difficulty in showing that / F(z) dz —► 0 
Jcn 

as R —> 00 . Moreover, F{z) m (22-17) 
has only two singular points, which are 
poles at zi « ia and zi «*=» —ia. Only 
one of these, z\ « io, lies in the upper half 
plane. Accordingly, 

I ' 

Jc » + 2“^ 



if C is the boundaxy of the semicircle (Fig. 32) and R is sufficiently large to include the 
point 2 « 2 i. 
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Now the mddue of (23-17) At« » ta is «"*/2at, and since J F(t) <& —* 0 as /J 
we conclude from (22-18) that 


/" ««d* ^ r" e**ax „ . 


j a* -f ** 4 - a;* ' 2ai 

This result is real, and hence the integral in (22-16) is 


a 


L 


idx 


xe 

a 


j a* -f X* 

Inasmuch as the integrand in (22-19) is even, we conclude that 


f; 


'o Or 4* X* 


idx 


ire 


(22-19) 


( 22 - 20 ) 


PROBLEMS 


i 


rtx 

1, Use relations (22-2) and (22-3) to write the integrals i r 

2ir ^ 

• At e yrxrx a\ i i i At ij* *a i 


dB 


H- sin e 

in the form (22-4) and evaluate the resulting integrals by the residue 


'o 6 4* 3 cos 0 
theorem. Check your calculations by formula (22-10). 

d$ 2x 

t. Show that / --^ ^. 0 < o < 1. 

7o (1 4-a cos (1 -o*)’* 

dx X 

8.Showthat 

i. Referring to Prob. 2, show that 


6« Show that: 


t. 


0 (a -j- cos 6)^ (a* — 1)^ ’ 

V2 


if o > 1. 


, . r dx I r dx xy/2 ^ 

^ io 1 + X* 2 i-oo 1 4- x^ 4 

if 

f 


6. Show that 
T, Show that 


coexdx » 

(1 +!>)=" 2‘’’' 


£ 


dx IT (2n — 2)! 

,(1 -h x*)« ” 2'^^[(n - !)!]»' 

if n is a positive integer. Hitii: The residue of (1 4- «*)~^ at * i is 
—n(n 4-1) ... (2n — 2) . 

■t. 


(n - l)!2*'‘-~i 

One way of seeing this is to let f « s - f, so that (1 4* • (it)~^"^(l — J^)” 

The ooc^dent of i/t is easily found by use of the bmosnial theorem. 
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. , la (hiorie des probabilitis n^est que le bon sens confirms 
par le calcuV* — Laplace. 


There is no branch of mathematics that is more intimately connected 
with everyday experiences than the theory of probability. Recent de¬ 
velopments in mathematical physics, moreover, have emphasized the 
importance of this theory in every branch of science. A knowledge of 
probability is required in such diverse fields as quantum mechanics, 
kinetic theory, the design of experiments, and the interpretation of data. 
A recently developed branch of mathematics known as operations analysis 
applies probability methods to questions in traffic control, allocation of 
equipment, and the theory of strategy. Cybernetics, another field of 
recent origin, uses the Uieory to analyze problems in communication and 
control. In this chapter on probability the reader is introduced to some of 
the ideas that make the subject so useful. 


FUNDAMENTALS OF PROBABILITY THEORY 

1. A Definition of Probability. The idea of chance enters into everyday 
conversation: *Tt will probably rain tomorrow,^' “There may be a letter 
for me at the office,^’ “I probably won't get double six on the next throw." 
It is often possible to assign a numerical measure to the notion of proba¬ 
bility which these statements illustrate. Such a measure, however, must 
take account of the speaker's state of knowledge. For instance, in the 
second statement the mailman may know that a letter is there, since he 
put it there himself. His measure of probability and mine are therefore 
not the same. Probability for me is based on my knowledge, and proba¬ 
bility for him is based on his. 

From this viewpoint (which is one of several possible viewpoints) 
probability is a measure of ignorance. In simple oases the state of ignorance 

609 
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can be accounted for, and probability can be defined as follows: Wo a^ree 
to regard two events as equally likely if our ignorance is such that we have 
no reason to expect one rather than the other. For example a 4 or a 0 
is equally likely whem a true die is tossed; heads and tails are ecpially 
likely in a toss of a symmetric coin; acc of hearts and a(‘e of spades are 
equally likely to he drawn from a shuffled deck. 

In the latter example how shall we measure the probability that the 
card drawn will in fact be the ace of hearts? We say that there is “one 
chance out of 52“ and define the probaVjility, accordingly, to be ^ 52 - ff 
it is required only that the card be an ace, common sense suggests that 
the probability should be four times as great, for there are four a<'es, 
equally likely, and only one ace of hearts. Now, the value indeed, 

the probability that a card drawn at random is an ace. Keasoning in this 
way, we are led to tlic following definition: 

Definition. Suppose there arc n mutualhj exclusive^ exhaustive^ and 
equally likely cases. If rn of these are favorable to an event A , then the proba¬ 
bility of A is m/n. 

The term mutually exclusive means that two cases cannot both happen 
at once; the term exhaustive means that all possible cases are enumerated 
in the n cases. There is seldom difficulty in seeing that these conditions 
are satisfied, but careful analysis is sometimes needed to make sure that 
the cases are equally likely. For example, let two coins be tossed, and 
consider the probability that they both show heads. We might reason 
that the total number of cases is three, namely, two heads, a head and a 
tail, or two tails. Since only one case is favorable, the probability is 
Now, this reasoning is incorrect. It is true that there are three cases, but 
these cases are not equally likely. The case of a head and a tail is twice 
as likely as the others, since it can be realized with a head on tlic first 
coin or with a head on the second coin The reader can verify that there 
are four equally likely cases and that Ihe required probability is 

If an event is certain to happen, then its probability is 1, since all cases 
are favorable. On the other hand if an event is certain not to happen its 
probability is zero, since no case is favorable. By means of the definition 
the reader may also verify the important equation 

g = 1 - p, 

where p is the probability that an event happens and q the probability 
that it fails to happen. 

Since one must begin somewhere, it is impossible to define everything, and every 
mathematical theory contains some undefined terms. These terms should be so simple 
that they are easily understood and also so simple that they are not readily defined in 
terms of anything simpler. The notion “equally likely'’ is an example of such a term; 
it was ejq)lAined and illustrated in the foregoing discussion but not defined. 
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Example h If a pair of dice is thrown what is the probability that a total of 8 shows? 
The first die can fall in 6 ways, and for each of these the second can also fall in 6 ways. 
The total number of ways is 


6-f6-f6-f64-6-h6«6-6-36 

and these are equally likely in this problem. A sum of 8 can be obtained in 6 ways, 
namely, as 

2 4-6,6 4* 2,3 + 5, 6-f 3, 4 4-4 
and hence the desired probability is 

This computation of the total number of cases illustrates an important principle of 
combinatory analysis: If one iking can be done in n different ibays aiui another thing can 
be done in m different ways^ then both things can be done together or in succession in mn 
different ways. 

Example 2. In a well-shuffled deck what is the probability that the top 4 cards are, 
resrHJctively, ace, two, three, and four of hearts? 

To find the number of equally likely cases we consider the various possibilities for the 
top 4 cards. The first card may be any one of 52; for each determination of that card 
there remain 51 possibilities for the next; and so on. Repeated use of the principle 
mentioned at the end of the last example gives 

62-51-50-49 


for the total number of cases. Since only one case is favorable, the desired probability 
is the reciprocal of this. 

Wlien r things are dealt into r numbered spaces from a stack of n distinct things, then 
any particular arrangement of the objects is called ^‘a permutation of n things r at a 
time.” If the total numl)er of such permutations be denoted by nJpr, the foregoing 
reasoning yields the important formula 

nPr « n(n - l)(n — 2) ... (n - r + 1). 

Example 3. If a hand of 4 cards is dealt from a shuffled deck what is the probability 
that the hand consists of ace, two, three, and four of hearts? 

The difference between this example and the preceding is that now the order is not 
relevant. Let C denote the number of distinct 4-card hands, not counting order. Then 
the number of distinct 4-card hands when the order is counted is 

C.4P4, 

since each hand of 4 cards admits 4 P 4 different orderings of its members. On the other 
hand the number of distinct 4-card hands when order is counted is also equal to 62^4 
by Example 2. We have, therefore, 

C-iPi ■» 6Jp4, 

so that 

Mp 4 ^ 52-51-50 49 ^ 62! 

^ 4 P 4 "" 4-3-21 “ 41481* 


The desired probability is the reciprocal. 

When r things are taken from a stack of n things, the groupa so obtained are called 
“combinations of n things r at a time.” If the number of such combinations is denoted 
by nOr, the above reasoning gives the important formula 

^ nPt nl 
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Itt tto formula the arrangement of members in a group is not considered. As in the case 
of poker hands, two groups are counted as distinct only if they have different composi¬ 
tions. 

Example 4. What is the probability of drawing 4 white, 3 black, and 2 red balls from 
an um containing 10 white, 4 black, and 3 red balls? 

We suppose that the balls are not replaced. The number of ways to get 9 balls from 
the 17 is itC#. The number of ways to get 4 white from the 10 white is 10C4. The 3 
black balls can be chosen in ways, and the 2 red ones in $0% ways. The number of 
favorable cases is found by multiplication (cf. Example 1), so that the desired proba¬ 
bility is 

sCt 252 
_ «« 

Example 5. If a number x is choscm at random on the interval 0 < a: < 1, what is the 

probability that M < a: < 

....^.^.■-,) Wc imagine the unit interval divided 

into 7 segments each of length H (Eig. 1). 

^ Since the jx)int may be in any one of 

these there are 7 cases, and the phrase 
''at random'’ ensures that these cases are equally likely. Since only 2 cases are favor¬ 
able, the desired probability is 


PROBLEMS 

1 . What is the probability that the sum of 7 appears in a single throw with two dice? 
What is the probability of the sum of 11? Show that 7 is the most probable throw. 

2. An um contains 20 balls: 10 whit^e, 7 black, and 3 red. What is the probability 
that a ball drawn at random is red? White? Black? If 2 balls are drawn, what is the 
probability that both are white? If 10 balls are drawn, what is the probability that 5 
are wliite, 2 black, and 3 red? 

3. "If 3 coins are tossed, some pair is sure to come down alike. The chance that the 
third coin fell the same way as that pair is and hence the j)robability that all 3 fall 
alike is H'' What (if any tiling) is wrong with this argument? What is the proba¬ 
bility that 3 coins will fall alike? 

4. What is the probability that a 5-card hand at poker consists of 4 kings and an odd 
card? 5 spades? A sequence in the same suit, such as 2, 3, 4, 6, 6 of hearts? 

6, In how many ways can you scat 8 persons at a table? Arrange 8 children in a ring 
to dance around a Maypole? Make a bracelet of 8 different beads on a loop of string? 

6. The seats in a concert hall are arranged in an m by n rectangle, the side m being 
parallel to the stage. What is the chance that a ticket bought at random will be for a 
seat in back? On the side? Somewhere on the outside rows of the rectangle? 

7. Two dice are tossed, (a) What is the probability that the first die shows 2? (5) Sup¬ 
pose you are given the additional information that the total shown by both dice is 9. 
What is now the probability that the first die shows 2? (c) If no information is given, 
what is the probability that the total shown is 3? {d) If it is known that the first die 
gave 2, what is now the probability that the total is 3? (Assume that the various 
numbers on the second die are equally likely no matter what is known about the first die.) 

% Sample Space. The equally likely cases associated with the definition 
of probability represent the possible outcomes of an experiment. For 
instance, the 36 equally likely cases associated with a pair of dice are the 
36 ways the dice may fall. Similarly if 3 coins are tossed, there are 8 
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equally Hkely cases corresponding to the 8 possible outcomes of that 
experiment. The set of all possible outcomes is called a mmpU Bpace; 
the ‘^points^’ of the sample space are events. This notion of sample space 
is meaningful even when the events are not equally likely and even if 
there are infinitely many possible outcomes. For technical reasons, how¬ 
ever, the events composing the sample space are required to be mutually 
exclusive. In tossing a die the events '‘an even number shows^' and “6 
shows*^ are not suitable for one and the same sample space. 

A finite sample space is one which has only a finite number of points. 
In such a space let the points (that is, events) have respective probabilities 

Ply P2y • • •} Pn 

with Pi + P2 H-h Pn * 1. 

Suppose the first m sample points, and only those, are favorable to another 
event A. Then we define the probability of A to be 

p(A) « Pi + p2 H-h Pm (2-1) 

(and similarly if some otlier set of sample points is in question). Thus, 
the points of the sample space are weighted according to their probabilities. 

The reader should observe that this definition is consistent with that of 
the foregoing section: If each point of the sample space has the same 
probability 1/n, the result (2-1) becomes 

11 1 m 

p{A) 

n n n n 

Sample spaces with constant probability are called uniform. 

For an example of a nonuniform sample space, consider the following 
experiment: Four coins are tossed, and we are interested in the number of 
heads. An appropriate sample space is composed of the events 

no heads, one head, two heads, three heads, four heads 
with respective probabilities, or weights, 

Ke, Ke, H^y H^y Ke* 

These values are found by counting cases, as follows. The 4 coins can fall in 2^, or 
16, ways. They give no heads in only one case, namely, when they all fall tails, and 
hence the required probability Is He• To obtain 1 head there are 4 cases: heads on the 
first coin or on the second coin, and so on. This gives He- Tor 2 heads, the 2 coins 
giving heads can be any 2 of the 4 coins. Since there are 4 C 2 “ 6 ways to choose 2 coins 
out of 4, there are 6 cases favorable to the event, tv>o heads. The probability, then, 
IB H^. The other entries are found in the same way, or by symmetry. 

To illustrate the use of this sample space, let us find the probability 
of getting at least two heads. Since the last three points of the sample 
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space, and only those, are favorable to this event, the required probability 
is 

Ks + ^6 + He 

Again, the probability that there is an odd number of heads is 


He + He 

since that event corresponds to the second and fourth point. On the other 
hand, this sample space does not give the probability that the third coin 
will fall heads, although the underlying uniform space tells us that tlie 
probability is H- 

Additional information concerning the experimental situation is apt to 
change the sample space. For example, if a toss of a die is known to have 
given an even number, the probabilities of 1, 3, and 5 are changed from 
H to 0. This question is discussed in Examples 2 and 3. 

When two sample spaces are constructed for a given experiment by the procedure of 
the text, it can be shown that they are consistent; that is, they give the same probability 
for any event to which they both apply. This fact is illustrated in the problems, though 
we do not give a formal proof. 

The notion of sample space enables us to define probability even when there is no 
underlying set of equally likely cases. Suppose we are given n events and a corre¬ 
sponding set of nonnegative numbers px such that pi + p 2 H-h pn “ 1. The events 

are said to form a sample space, the numbers p» are called probabilities, and the proba¬ 
bility of various associated events is defined by addition, as in the text. This abstract 
idea can be extended to sets of very general type, the role of the numbers p, being taken 
by a so-called measure on the sot. With such an approach probability theory is included 
in a branch of mathematics known as the theory of measure. ^ A sample space defined 
with the help of arbitrary numl>ers p, is considered in Example 1. 

Example 1. A loaded die has probabilities 


Pi, PS, P8, P4, P6, P6 

of giving the respective values 

1, 2, 3, 4, 5, 6. 

What is the meaning of the condition pi -h P 2 H-h pe * 1? If this condition is satis¬ 

fied, find the probability that a single toss will give either a 4 or a 6. 

The condition means that one of the stated alternatives will certainly happen; for 
instance, the die does not land on edge. From a more abstract viewpoint, the condition 
means simply that the given events and probabilities form a sample space. When that 
is the case, the probability of getting 4 or 6 is P 4 + p« by definition. 

The assumption that “the probabilities are p,” is an example of a statistical hypothesis. 
It is an important task of statistical theory to test the validity of such hypotheses by 
examining the consequences. 

The reader should notice that the values pi were not given, and could hardly be 
given, by considering “equally likely cases."' They may be estimated, however, by 
is^peatedly tossing the die. When pi is the probability of the ace, it can be shown that 
the proportion of aces actuafiy observed, in a large number of tosses, is likely to be close 

^ See Appendix C. 
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to Pi, If tliere &re n toseeiB, and if m aces are observed, this proportion m/n is cltlled 
the relative frequency. The connection between probability and relative frequency is 
discussed in Secs. 8 to 10. 

Example 2. Two coins are tossed. Suppose a reliable witness tells us ‘"at least 1 coin 
showed heads.” What effect does this have on the uniform sample space? 

The uniform sample space had the following appearance before we received the extra 
information: 


]Rv«nt. 

HH 

TH 

HT 

TT 


Probability .... 

H 

H 




The new information assures us that the last event is ruled out but gives no indication 
concerriing which of the other three may have occurred. Since these three events were 
equally likely to begin with, they are considered to be equally likely in the new situa¬ 
tion. (That is not a theorem, but an axiom of probability theory.) The new sample 
space, therefore, is 


Event. 

HH 

TH 

HT 

TT 


Probability . .. 

H 


H i 

0 


Example 3. The tossing of 2 coins can be described by the following sample space: 


Event. 

no heads 

one head 

two heads 


Probability. 


H 



What happens to this sample space if we know that at least 1 coin showed heads but 
have no other special information? 

The first event is ruled out, but we are not told which of the remaining ones occurred. 
It is an axiom of probability theory that the relative probabilities of the remaining 
events remain unchanged in a situation such as this. Since the event “1 head” is twice 
as likely as “2 heads” in the original space, the same is assumed in the new one. The 
new sample space is therefore 


Event. 

no heads 

one head 

i 

two heads 


Probability. 

0 

% \ 



(remember that the probabilities must add up to 1). The reader should check that this 
result is consistent with that of Example 2. 

If the events Ei, E%, ,,,, Ek of the sample space are the ones favorable to A and have 
probabilities pi, pj, ..., p*, the information that A happened gives a new sample space 
with events Ei^ E^, ,, Ek only. The probabilities on that new sample space are 

cpi, cp2, .. cpk, 

where c is a constant so chosen that the sum is 1: 

1 

c --* 

Pi -f Pj -f • “ + P* 

This is the general assertion which is illustrated in Examples 2 and 3. 
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PKOBXJBMS 

1. A coin is tonsed 3 times. Construct a uniform sample space for this experiment. 
OThat is, make a table showing the 8 possible outcomes HHH, HHT, ... and their re¬ 
spective probabilities H, •«• •) According to your sample space what is the proba¬ 
bility of at least one H? At most one H? A run of exactly two H’s in succession? A run 
of at least two H's in succession? H appearing before T? H appearing for the first 
time in the second toss? The sequence THT? The sequence TTT? 

2. In Prob. 1 suppose we are concerned only with the number of H’s. Construct an 
appropriate sample apache. (That is, make a table showing the 4 possible outcomes: no 
H’s, one H, ... with their respective probabilities.) Decide which questions in Prob. 1 
can be answered on the basis of this sample space, answer them, and verify the agree¬ 
ment with your answers to Prob. 1. 

3. The following argument is attributed to I^eibniz: '‘A total of 12 with 2 dice is just 
as likely as a total of U. For, 12 can materialize in just one way, namely, by getting 
6 on one die and 6 on the other; and 11 can also materialize in just one vray, namely, by 
getting 6 on one die and 5 on the other.” Using the notion of sample space explain 
what is wrong with Leibniz' conclusion. (With the uniform sample space, 11 can 
materialize in 2 ways. On the other hand if we choose a sample space in which the 
event on one die and 5 on the other” is a single point, the weight of this point is 
different from that of the point ”6 on one die and 6 on the other.” The student should 
verify these remarks in detail) 

4. The following is due to d’Alembert: ”lf we want to get at least one head with 2 
tosses of a coin, heads on the first toss makes the second toss unnecessary. So there are 
3 cases, H, TH, and TT, of which 2 are favorable to heads. Hence the probability of 
heads is 3^.” Discuas, with reference to the uniform sample space and also with refer¬ 
ence to the sample space which has only the three points H, TH, TT. (Ambiguities 
such as this and the preceding can cause serious errors in practice if the notion of sample 
space is not well understood. In fact, one of the reasons for defining the sample space 
is to avoid this kind of difficulty.) 

6* What happens to the uniform space associated with a pair of dice, if we are told 
that the total shown is 7? 

6. Four coins are tossed. A reliable witness tells us that there are at least as many 
heads as tails. What is the most probable number of heads, and what is its probability? 
StLggesiion' Use the samjile space given in the text. 

7. A coin is tossed 3 times. If we know that a sequence of 2 tails in a row did not 
occur, what is the probability that a sequence of 3 heads in a row did occur? Suggestion: 
Use the uniform sample space, 

3. The Theorems of Total and Compound Probability. Statements about 
probability are often given an abbreviated notation. If A and B are 
events, AB means the event and that is, AB happens only when 
both A and B happen. For example, if two cards are drawn in succession 
without replacing, suppose A is the event *'the first draw gives a king’' 
and B is the event second draw gives an ace.” Then AB happens 
if we get a king on the first draw followed by ace on the second. 

^ It is customary to write piA) for ‘^the probability of the event A.” 
Ih the foregoing example p(A) « since there are 4 kings among the 
52 cards. If nothing is known about the results of the first draw, then 
p{B) « Ht 
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To see this, note that the total number of cases is 52*51, since there are 62 ways to 
get the first card and, when that card is chosen, 51 wasrs remain to get the second. To 
count the cases favorable to B, observe that the ace obtained on the second draw may 
any one of the 4 aces. For each choice of this ace there remain 51 possibilities for 
the first card. The number of favorable eases, then, is 4-51, and hence 


p(B) 


4 51 4 

52 51 “ 52 * 


(W) 


Sometimes two events A and B are so related that the information that 
A happened changes the probability of B. To deal wiili this situation it 
is customary to write Pa{B) for *The probability of given A” In the 
example cited previously, 

Pa(B) = Hi (3-2) 

(for if A happened, the first draw gave a king, and hence the 4 aces are 
to be found among the remaining 51 card.s). On the other hand when A 
is the event “tlio first draw gives an ace” and B, as before, is the event 
“file second draw gives an ace,” then 'Pa{B) (since now only 3 

aces remain wlum A hajipens). Both values for Pa{B) are diffiTcnt from 
p[B), the probability of ace on the second draw when nothing is said about 
the first draw. 

In this riotatioii the theorem of compound prohability takes the following 
form: 

'J'heoukm // A and B arc any events^ then 

p(AH) - p(A)pa{B), (3-3) 

Informally, *hhe jirolialiility that A and B happen is the probability 
that A hap[)ens times the ])r{)babiUty that 7? then happens.” A proof is 
easily given by eonsid(‘nng eiiually likely eases. Let and Uab denote 

the numbers of cavses favorable to J, /?, and AB, respectively. Then 


p{AB) = 


rtab 


na n^b 
n na 


Now, na/n is p{A) by definition. After A havS happened, the only possible 
cas<\s are the ria cases favorable to A. Of these, tliere are nab cases favor¬ 
able to B. Since the Ua cases are to be considered ecjually likely, the 
(juolient riaf/na represents the probability of B when it is known that 
.4 happened, and this gives (3-3). 

To illustrate tlie theorem (3-3), let us find the probaliility of drawing 
2 a('es in succession from a pa(‘k of 52 cards. The probability of ace on 
the first trial is 4^2* After the first a(‘e has lieen drawn, the probability 
of drawing another ace from the remaining 51 cards is so that the 
probability of two aces is 


%2'Hi M21- 
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This assumes that the first card is not replaced. When it is replaced, the 
reader will find that the desired probability is 


H2^f42 =* K69- 

For another illustration of the theorem (3-3), lot us find the probability 
of drawing a white and a black ball in succession from an urn containing 
30 black balls and 20 white balls. Here the probability of drawing a white 
ball is After a wdiite ball is drawn, the probability of drawing a 

black ball is Hence the probability of drawing a white ball and a 

black ball in the order stated is 


The events A and B are said to be independent if the information that 
A happened does not influence the probability of B, Hence for such 
events PAiB) = p{B)^ and the theorem of compound probability takes the 

form 

p{AB) ^ p{A)p{B)^ for independent events. (3-4) 


For instance, let a coin and a die be tossed, and let A be the event ‘^head 
shows’^ while B is the event “4 shows.” These events arc independent, 
and hence the probability that heads and 4 both appear is 

p{AB) = v{A)p{B) - mm - ^2. 

The result (3-4) is readily extended to any number of independent events 
A, jB, C, .... 

Besides the theorem of compound probability, there is a second funda¬ 
mental relationship, known as the 
throrem of total probability. If A 
and B are two events, A + B is 
defined to be the event or B 
or both.” For instance, let A be 
the event number greater than 
3 shows” while B is tlie event ”an 
even number shows” in a toss of 
a die. Then A -f ^ happens if the die gives 2, 4, 5, or 6. In this 
notation the theorem of total probability reads as follows: 

Theorem. When A and B are any events, then 

p{A -h B) = p(A) + p{B) - p{AB). (3-5) 

We can represent the statement (3-5) diagrammatically by the intersecting 
point sets A and B shown in Fig, 2. 

Referring to the definition of probability by equally likely cases, suppose 
the numbers of cases favorable to A, JB, AjB, and A + B are denoted by 
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respectively. To find the number favorable to ^ it will not do 
simply to add Ua, and nb, for the cases favorable to both A and B are counted 
twice in tiiis addition. To take account of that we must subtract n^b, 
thus: 

na^h = na + nb - nab* 


Dividing by n, the total number of cases, gives 

Tib n-ob 
n n n n 

which is equivalent to (3-5). 

To illustrale the theorem, let us find the probability that at least one 
die gives 4, when two dice are tossed. The probability that both give 4 
He- The probability that the first gives 4 is and similarly for the 
second. Hence the probability that at least one gives 4 is 

p{A -h -S) = H H ““ He = ^He- (3-6) 

This 1*5 consistent with the lesult givoix by counting cases Specifically, there are 5 
cases with a 4 on th(‘ first and a number other than 4 on the second, there are 5 cases 
With 4 on the second and a number other than 4 on the first, and there is 1 case with 4 
on b(jth The number of favorable cases is therefore 5-j-5“hl «11 so that (S-d) 
fallows 

For mutually exclusive events, that is, for events A, B whieh cannot 
both happen, p{AB) = 0. Hence the theorem of total probabilit}^ takes 
the form 

p{A + B) — p{A) + p(B)y for mutually exclusive events. (3-7) 

The statement (3-7) can be depicted 
by the nonintersecting point sets in 
Fig 3 

For example, in a toss of a die let 
.4 be the event shows'’ while B 3 

is the event ^‘5 shows.” Since these 

events are mutually exclusive, the probability of getting either 4 or 5 is 

piA +B)^ p(A) -b p(B) = K + H - 

A result similar to (3-7) applies to any number of mutually exclusive 
events A, B, C, .... 

The for(‘going analysis, by counting cases, establishes the theorems of 
total and compound probability for uniform sample spaces only. Actually 
the results are valid for arbitrary sample spaces, as will be indicated next. 
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Assuming that the sample space is finite, let the events Ei of the sample 
space be so numbered that 

Elf ..-B; 

are favorable to A alone, 

are favorable to both A and B, while 


Ek-^lf . . Etn 

are favorable to B alone. If the associated probabilities are pi, then (3-5) 
is equivalent to the identity 


Pi H-h Pm =» (Pi H-h Pi + py-fi H-h Pfc) 

+ (Pj+i d- h Pk + Pk+i d-h Pm) — (P;-Ki d-h pit). 

The three parentheses on the right represent, respectively, p(A), p{B), 
and p(AB) by definition. 

To derive (3-3) for a general sample space, recall that the sample points 
favorable to B have the same relative weights after A happened as before. 
Hence, in the previous notation, 


p{AB) = pj+i d-h Pa 


(pi d - h Pk) 


Pj+i 


<Pi d- hPk 


d-...+ 


Pk 


Pi^ -h pA> 


«= p{A)pa{B). 

Example 1. The probability that Peter will solve a problem is pi, and the probability 
that Paul will solve it is p 2 . What is the probability that the problem will be solved if 
Peter and Paul work independently? 

The probability that both solve it is Pip 2 , by the theorem of compound probability, 
(3-3). Hence the probability tliat at least one solves it is 


Pi *f P2 ~ PlP2 (3-8) 

by the theorem of total probability, (3-5). 

Example 2, Solve Example 1 by finding the probability that both fail. 

Peter's probability to fail is 1 — pi, and Paul’s probability to fail is 1 — p 2 . The 
probability that both fail is 

(1 - p,)(l - Pit) 

and the probability of the contrary event, that at least one succeeds, is 

1 - (1 Pi)(l - p2). (3-9) 

The consistency of (3-8) and (3-9) is easily verified. 

Example 3, A bag contains 10 white balls and 15 black balls. Two balls are drawn 
in succession. What is the probability that one of them is black and the other is white? 

The mutually exetoive events in this problem are (n) drawing a white ball on the 
first trial and a black ball on the second, (5) drawing a black ball on the first trial and 
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a white on the eecoiid. The probability of (a) is *9^6and that of (6) is 
so that the probability of either (o) or (b) is 

Example 4. How often must a pair of dice be tossed to make it more likely than not 
that double 6 appears at least once? 

The probability that double 6 does not appear on a given toss is *^^6, no matter 
what is known about the preceding tosses. Repeated use of the theorem of compound 
probability gives 

for the probability that double 6 does not appear in any of n tosses. It is desired to 
choose n in such a way that this probability is less than Thus, 

Taking the logarithm gives 

< - log2 


or 


n > 


1ok2 
log 3%5 


24.6. 


Thus 25 tosses suffice, but 24 do not. 

Example 5. Peter and Paul take turns tossing a pair of dice. The first to get a throw 
of 7 wins. If Peter starts the game, how much better arc his chances of winning than 
Paul’s? 

This problem is different from any we have coasidored hitherto, in that there are 
infinitely many possibilities. Namely, Peter may win on his first throw, or on his second 
thiow, or on his third throw, and so forth. To apply tlie preceding ilu'ory, w'e siinpiy 
consider the probability that Peter wins in n throws and take the limit as n —► 

A wide variety of questions involving infinitely many outcomes may be dealt with in a 
similar manner. 

The probability of 7 is and the probability of not getting 7 is Vg. Ifcnce the 
probability that Peter wins on his first throw is }i. The probability that Peter wins 
on his second throw is (®o)^(36) (smee Peter’s first throw and Paul s first throw must be 
other than 7 but Peter s second throw must be 7). Peter's probability of w inning on his 
third throw is and so on 

By the theorem of total probability the probability that PeK*r win.s is 
H + + ihm)' + ’ • — ih)i^ + r -f +. ..), where r » 


111 1 6 
61 - r "" 61 - n' 


(3-10) 


A similar procedure shows that Paul’s chance of winning is or one can reason as 
follows: The probability that 7 does not occur m n trials Ls ^^ince the limit is 

zero, the probability of an eternal game is zero, and PeWr or Paul is sure to win. Thus, 
Paul’s chance is 

1 - Ki » 


PROBLEMS 

to What ifl the probability that 5 cards dealt from a pack of 52 cards are all of the 
same suit? 
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3. Five coins are tossed simultaneously. What is the probability that at least one of 
them ehows a head? All show heads? 

8 . The probability that Paul will be alive 10 years hence is H and that John will be 
alive is . Whaf is the probability that both Paul and John will be dead 10 years hence? 
Paul alive and John dead? John alive and Paul dead? 

4 . One purse contains 3 silver and 7 gold coins; another purse contains 4 silver and 8 
gold coins. A purse is chosen at random, and a coin is drawn from it. What is the 
probability that it is a gold coin? 

6 * Paul and Peter are alternately throwing a pair of dice. The first man to throw a 
doublet is to win. If Paul throws first, what is his chance of winning on his first throw? 
What is the probability that Paul fails and Peter wins on his first tlirow? 

8. How many times must a die be tlirown in order that the probability that the ace 
appear at least once shall be greater than J^? 

7. Tw'enty tickets are numlwed from 1 to 20, and one of them is drawm at random. 
What is the probability that the number is a multiple of 5 or 7? A multiple of 3 or 5? 

Note that in solving the second part of this problem, it is incorrect to reason as follow^s: 
The numljer of tickets bearing numerals that are multiples of 3 is 0, and the number of 
multiples of 5 is 4. Hence the probability that the number drawm is either a multiple 
of 3 or of 5 is ^^0 4* 5 ^^0 W'hy us tins reasoning incorrect? 

8 . A card is chosen at random from each of 5 d(‘cks What is the probability that all 
are face cards? Would the probability be larger or smaller if all 5 cards were taken from 
one deck, without replacing? 

9. Answer the two questions in Prob. 8 if the desired hand is 1, 2, 3, 4, 5 of clubs; if 
the desired hand is to have at least 2 aces but is otherwise imrestncted. 

10 . Each of two radio tubes has probability p of burning out during the first 100 hr 
use. If both are put into service at the same tune, what is the probability that at least 
one of them is still good after 100 hr? Generalize to n tubes. If p « 0.1, how many 
tubes are needed to give a probability > 0.99 that at least one is good after 100 hr? 

4 . Random Variables and Expectation.’ A process is random if it is 
impossible to predict the final state from the initial state (as, for example, 
in a toss of a coin or a die). Associated with a random process there may 
be certain numerically valued variables which themselves have a random 
character. For instance, if X denotes the number obtained by tossing a 
die, then A" is a variable which assumes the values 

1, 2, 3, 4, 5, 6 

corresponding to the six events: 1 shows, 2 shows, and so forth. The re¬ 
spective probabilities are 

K, K, K, K. 

Again if A is the number of heads obtained when 3 coins are tossed, then 
A is a variable which assumes the values 

0, 1, 1, 1, 2, 2, 2, 3 (4«1) 

^ Sections 4 through 6 may be omitted on the first reading without loss of continuity, 
but they are essential to the developments in Sec. 13. 
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corresponding to the various ways the coins may fall. For instance, 
X «= 2 corresponds to each of the three events: HHT, HTH, THH. 

Similarly, if a gambler stakes d dollars on a game, the amount he wins 
assumes the values 

d, —d 

in correspondence with the events “he wins the game” and “he loses the 
game.” If his probability of wanning is p, the respective probabilities of 
X d and X = —d are 

p, 1 ~ p. 

These special cases illustrate the important idea of raitdom variable, A 
random variable is a numerical'^valiicd function defined on a sample space 
In symbols, 

X(^) = T,, i=l, 2, (4-2) 

w’^here e^ are the events of the sample space and x^ are the values of the 
random variable A". 

Let |e,j be a sample space of n events c, with associated probabilities 
pt. Let be a random variable defined on jf,j and assuming the value 
Xi at the fth sample point, so that (4-2) holds. The expectation or expected 
value E{X) i.s then defined by 

E{X) = PiTi + P 2 X 2 d-h PnXn^ (4-6) 

For example, if X is the numlxu* obtained in a toss of a die, then X 
assumes the valut*8 1, 2, 3, ... with corresponding probabilities p^ = 3^. 
I lence 

E{X) = K'l + K 2-1~34'3 + K'4 + J^-5 + 3^*6==J^. 

Similarly, if A" is the number of heads obtained when 3 coins are tossed, 
then (4-1) and (4-3) give 

E{X) - + H +H+ys-^ y + H + H + H 

wLen W’e note that p, — ^ « in this cavse. 

By grouping terms we can write the above sum in the form 

E{X) = VsO + ?8-l + 5^8-2 + 3^-3. 

The factors 

0, 1, 2, 3 

represent the numerically distinct values of X, and the factors 

Hj H 

represent tlie probabilities corresponding to these distinct values. For 
example, % is the probability of 2 heads when 3 coins are tossed, and 
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hence i« <lie probability that X » 2. A similar grouping of terms can 
be applied to the general definition (4-3) and yields the following useful 
theorem: 

TmeoniOM I. The expectation E(X) is given by 

E(X) = PiXi + P 2 X 2 + • • • + PfXr 

where X 2 t ..air are the numerically distinct values of X and where Pi 
ie &ie probaMlity that X = Xi. 

Let Xi be the r distinct values of a random variable X, and let yj be the 
9 distinct values of another random variable Y. The sum X -f is a 
random variable which is defined to be x, + when X = Xi and Y « yj. 
Thus, X + F is defined on a sample space whose points consist of the r$ 
events 

X - Xi and F - y^ (4-4) 

for t « 1, 2, ..r and j = 1, 2, One of the most important theo¬ 

rems in probability theory concerns sums of variables and reads as follows: 

Theorem IL The expectation of the sum of two random variables is equal 
to the sum of the expectations^ or in symbols, 

E{X + F) - EiX) -h E(Y), (4-5) 

To prove Eq. (4-5) let p,j be the probability that simultaneously X « x,- 
and F = yy. Thus, p*/ is the probability of the event (4-4). The definition 
of expectation yields 

EiX -f F) + yj)y (4-6) 

since x* + yj is the value of X + F which corresponds to the event (4-4). 
By rearrangement, 

E{X + F) = z *.• (E p.,) + z y, (E Pii) ■ (4-7) 

Now, Zp<: j represents the probability of 

(X « Xi, Y « yi) or (X = x,-, F « ^ 2 ) •.. or 

(X » x,‘, F - ysY 


Hence, it represents ^ the probability P, that X »= x,*. Theorem I now gives 

Z*. (Zp.y) “ Z^.P. = TO 


and similarly. 


Z!/>(Zp.,) “.EdO. 


*This rfiowB that XXpa >• XPi » 1, hence that the events (4-4) actually do formi 
•ample space. 
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Thus, (4-7) is equivalent to (4-6). The extension to any number of vari¬ 
ables is immediate. 


The following alternative approach to Theorem II does not require the use of Theo- 
rera I, X be defined on a Rample space {a»} containing n events and F on a space 
I containing w ©vents. Thus, 

X{(h) « Xx and F(6y) *• yj. 

The variable X d" F is defined on a sample Rpace whose mn ©vents etj hapjwn when, and 
only when, Oi and bj both happen. The value of X + F corresponding to tlie event e,y 
IS defined to be Xi + V}^ if Po i*^ Ihe piobability of then the definition of expecta¬ 
tion gives (4-(>), which may be w'ritten in Ihe form (4-7) as before. Since the event/S of 

the sample space {?>il are mutually exclusive (Sec. 2), the sum 23 Pu represents ih© 

J 

probability of 


a, and bi or and ... or a,- and 6»». 

Hence it represonts p*, the probability of a,. The first term in (4-7) is therefore E{X) 
by (4-3), and similarly, the second term is EiY). 

The sums 

l.Po and 2^ptj (4-8) 

j * 


are called the marginal probahihltes of a, and 6^, respectively. In modern statistic^al 
tlieory it is customary to start with th(‘ larger sample space jct;{ and to define the 
probabiliticvS on the smaller spaci'b lu,l and !?>;{ by means of (4-8). Theorem II is then 
valid, so to say, by fiat. 


Since Xpi = 1, 
center of mass: 


the expectation E{X) in (4-3) may be interpreted as the 


E{X) = 


Pl,Ti + P2^2 d-h PnXn 


Pi + P2 H-h Pn 


For equally likely Xi the result reduces to the arithmetic mean 

1 1 

EiX) ^ ~ (a:i + X 2 H-f- a:„), if each p» = -• 

n n 

Thus, E{X) is a measure of the location of X; it is a typical value. The 
following sections show that if sufficiently many observations of the 
variable X are made, the mean of those observations will almost certainly 
be close to E{X). In this sense, E{X) represents the average value at¬ 
tained by X in the long run. 

Throughout this section random variables were denoted by capital letters to avoid 
confusion between the variable and its values Xt or yj. In statistical literature the varia¬ 
bles are usually denoted by small letters. Since the distinction has now been sufficiently 
emphasized, we shall often use small letters in the remainder of this chapter. Thus, 
depending on the context, x, may be a set of random variables or the values of a single 
variable. 
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Example 1. Find tfae eacpeoted number of heads when n coins are tossed. 

Let X,' «“ 1 if the ith coin shows heads and X* «• 0 otherwise. Then, for each i, 


E(Xi) - ^ 

(The reader is cautioned that Xj, X 2 , ... are distinct variables here, not the different 
values Xi of a single variable.) The number of heads m is 


and hence 


m sa Xi -f X2 H-f- Xn, 

Eim) «X(Xi+X 2 +“--f Xn) 

«X(Xi) + F(X2)+"--hX(Xn) 




n 

2 * 


Example 2. From an urn containing a white and b black balls, a ball is drawn at ran¬ 
dom and set aside. What is the expected number of nliite* balls left in the urn? 

Let X be the number of white balls left. If a white ball is drawn, then X « a — 1, 
whereas if a black ball is drawn, then X ^ a. Hence 


E(X) 


0+6 


(a - 1) -f 


a -\-b 


a 

a b 


Example 3. A deck of cards is thoroughly slmffl<‘d. We say there is a coinctfJcnn’ if 
a card has the same position after shuffling ns it had before (e g , if il is the fourth from 
the top both times). Find the expecU^d number of coincidences. 

I 4 *t X", =*» 1 if the tth c^rd is in the same position before and after shuffling, and let 
X, » 0 otherwise. Then 

E(Xd - J^2-l +"«2-0 « 3^2. 

Since the number of coincidences is XX„ its expectation is 

B{Xi) -f E{Xt) + • • • + E{X^) « 1. 


PROBLEMS 

1. A bent coin has probability p of giving heads and probability g »= 1 — p of giving 
tails. l>t X be a random variable representing the number of heads when the coin is 
tossed three times; X is defined on a sample space consisting of the 8 events HHH, 

HHT, ... with associated probabilities p*, p^g,- (a) Make a table giving the 8 values 

of X associated with the 8 sample points and their resfxjctive probabilities. (6) Make 
a second table giving the 4 distinct values of X and their probabilities, (c) Compute 
the expectation E{X) from your table (a) and also from your table {b). 

2. If X is the numlier of heads and Y the number of tails, find E(XY) from your 
table in Prob. la and also from that in 16. Is it true that E{X Y) =« E{X)E{Yyi Htnt 
Make a table giving the 4 values of XF m the 4 cases of Prob. 16. 

8. Peter turns up the cards one at a time from a 52-card deck, and Paul tries to guess 
what the cards are. Find the expected number of correct guesses (a) when Paul calls 
out at random, perhaps repeating himself, (6) when Paul calls off the 52 cards, naming 
each one just once, (c) when Paul calls out "‘ace of spades” each time. (Assume that 
^ Paul has no actual insight into the behavior of the cards.) 

4. In Prob. 3, suppose Peter tells Paul what the card was immediately after Paul 
guesses. Paul has ihe good sense not to call any of those cards, since he knows they 
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have been set aside* What is the expected number of correct guesses now? HinU Let 
X, ■» 1 if the ith guess is correct, Xi « 0 otherwise. X(X,) » ? The expected number 
of correct guesses will l>e found to txj approximately log# 52. 

6. A coin is tossed repeatedly. What is the expectfnl number of the toss at which 
heads first appear? Hint: Let X be the numl>er of the toss at which heads first appear. 
Then X has the values 1, 2, B, ... with re8j>eetive probability's Y^, 34, .... The 

reader is reminded that * r/(l -* r)^ for any r such that |r | < 1. 

6. Discrete Distributions. When the values of a random variable are 
distinct, the associated probabilities pi may be written in the form 


P. ■= /(a-,). 

Since the .r,' are supposed to be all the possible values of x, we must have 

2/(x.) = 1, (5-1) 

just as in the last section 2pi = 1. Also/(x) > 0, because f{x) is a proba- 
bility. 

hV>r example, let .r be the number of heads obtained when 4 coins are 
tossed. If the value j = 0, 1, 2, 3, or 4 is given, then the probability to 
assume that value is determined by the ta))le 


X « 

0 

! 

1 

2 

3 

4 

fU) = 

^16 

"he 


fl'6 

He 


The function f{x) is called the frequency function for reasons which will 
now be explained. Suppose n observations of the variable .r are made; 
how often should we expect .r == a\? To answer this (question, let Xjt = 1 
ii .r = Xi at th(* kih o})servation and X* = 0 otherwise. The number of 
limes X — Xi is 

m ~ Xi + X 2 + *' * + 

Since the definition of expectation gives 

E{X,) = 1 ./fe) + 0[1 -/(T.)] = /(xO, 

we have the fundamental result 

E{m) = nf(Xi), (5-2) 

Thus, the frequency function f(xi) is proportional to the expected frequemy 
of the event x *= x, in a fixed number of observations. 

Since the values x,- are distinct, the events x = Xi and x ~ X 2 are 
mutually exclusive. Hence, by total probability, the probability of 
X » xi or X = X 2 is 


f{Xl) +/(X2)* 
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In just the same way the probability of a; « xt, orx2, or xj, is 

E/Cx,). (M) 

immi 

It k often desirable to consider the probability that x will not exceed 
a given value. If xi, T 2 , ..^6 the values of Xi which do not exceed 
t, then the probability that x < t k given by the sum (5-3). That is, 
the event is equivalent to the event “x «= Xi, or x = X 2 , ..or 

X « Xjfc.'^ It is customary to write 

^’(0 = E /(X.) (5-1) 

for summation over the values of x* which do not exceed t. The function 
F{t) thus obtained is called the distribution function; it gives tlie probability 
that X <L When t is so small that no satisfies the sum (5-4) 

has no terms, and F{i) = 0 for such i. When t is so large that every x^ 
satisfies x* < then the sum (5-4) includes every x^. In this case (54) 
gives the value F{i) = 1. 

For example, if x is the number of heads obtained when 4 coins are tossed, 
the distribution function is described by the following table: 


t 

« < 0 

0 < t < 1 

1 < t < 2 

2 <t <Z 

Z <t <4 

4 < t 

F(t) 

0 

He 

He 

‘He 

‘•He 

1 


These entries are obtained by adding the values of fix) which were found 
previously. For instance, corresponds to the interval 2 < ^ < 3 
because 

Z Kxi) = /(O) +f{\) +/(2) = fU + + Ke - 'He. 

The value is the probability of getting at most 2 heads when 4 coins 
are tossed. 

The variables x considered so far in this chapter are called discrete 
variables because they assume isolated values only. For instance, the 
number of heads obtained when several coins are tossed is an integer 0, 
1, 2, 3, ... (and cannot fill up an interval). The distribution of such a 
variable is called a discrete distribution; it is defined for all values of x, 
not only for the discrete set of possible values x*. One may also think of 
the frequency function as being defined for all x, taking fix) =0 for 
values X other than the x*. (For example, the probability of getting 3.2 
heads is mro.) The fact is that we may define fix) in any arbitrary fashion 
for values other than the xjk, provided some care is taken in the interpreta¬ 
tion of the results. This possibility is exploited in the following discussion. 
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Graphical representation of the functions f(x) and F(x) is given in 
Pigs. 4 to 6. Figure 4 is valid as a probability for all x. The relationship 
of fix) and F(x) is clarified, how¬ 
ever, if fix) is modified as shown in 
Fig. 5. Here, the value of fix) at 
any integer m is used for/(x) in the 
interval of length 1 centered about 
m. The resulting step function still 
gives the probability that z ~ Xk^ 
provided Xk is an integer. The ad¬ 
vantage of redefining fix) in tins 
fashion rests upon the following 
property, which is easily verified: If t is an integer^ then Fit) u the area 
under the curve of Fig, 5 up to the value x ^ t + }/^. For instance, the 
area up to the value x ~ 2]4, is found to be 

/(O) +/(1) +/(2) - F(2) 
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by adding the areas of the shaded rectangles. When the values Zk are 
equally spaced, similar considerations apply to any distribution and 
frequency functions F and/. For unit spacing, that is, for Xk^i — « 1, 

Eq. (5-1) expresses the fact that the area under the curve is 1. 

Actually, it is possible to describe the relationship of / and F directly, without intro¬ 
duction of the intermediate curve (Fig. 5). The description involves the so-called Stieltjes 
integral, which is now to be defined. I^t F{t) be a nondecreasing function on an interval 
a < t ^ h, and let 4i{t) be continuous. Choose a set of points ii, h, in on the interval 
to,fc], and choose intermediate values 

^ ^ ffc-fl* 

As the subdivision given by the 4s is made finer and finer, in such a way that 

max 14+1 ~ 41 -► 0, 
it can be shown that the expression 

20(.4)IF(4+i) - F(4)1 

tends to a limit (independent of the manner of subdivision and of the points f*). The 
limit is called the Stieltjes integral of <t> with respect to F and is written 

f\{t) dF{l), 

Ja 

When F{t) is a discrete distribution corresponding to Xk and /(x), the function F{t) 
has a jump of value f(xk) at each value Xk but is constant between those values. Hence 
the differences 

F(4+i) F(tk) 

behave much like the function exemplified m Fig. 4. They aasume the value f(xk) if the 
interval (4,4+i) contains a single point Xk, and they assume the value 0 if the interval 
contains no point Zk- The relationship of / and F is now dijscribed by the equation 

F{C) = j‘ dh\x) 

where the integral is a Stieltjes integral. Although we have not defined a diffen'ntial 
dF, we may think of dF{x) as being equivalent to the frequency function f{x) in the 
sense described above. 

Example 1. In terms of the distribution function, express the probability that 
< X <h, where a and b are two numbers with a < h. 

The event “a; < can materialize in the mutually exclusive forms 

or a < X < b. 

Hence, by total probability, 

Pr (x <h) ^ Pr (a: < a) -f Pr (a < a; < 6) 
where Pr means “the probability that.” This yields the desired expression 
Pr (a < X < 6) « F{b) - F(a) 
when we recall that the distribution function F{i) satisfies 

Pr (X ^ 6) « F{h\ Pr (x ^ a) F(a). 


(&-5) 
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Example 2. In terms of the frequency function /(x) the expectation of any variable 
y « ff{^) is 

E{y) * XmlM, Vk « giXk)^ (5-6) 


We consider the variable y to be defined on a sample space whose points are the n 
events 


X » Xi, X *« X2, ,.., X •• Xn- 


The probability of the event “x » x*” is/(xa); the value of y corresponding to the event 
“x « xjk” is j/fc « g{xk). Hence, (5-6) follows from the general definition of expectation. 


PROBLEMS 

1. Suppose a coin is tossed 5 times. What is the probability that this experiment 
will yield 0, 1, 2, 3, 4, 5 heads? 

2. If X IS tlie number of heads in Prob. 1, make a table representing the frequency 
function/(x). Plot/(x) and also the step-function modification (see Figs. 4 and .5) 

3. In I^rob. I make a table and also a graph for the distiibution functimi F{t), 

4. If a coin is tossed 5 tim(‘a, find the probability that the number of hes<ls x satis¬ 
fies 1 < J < 4 by use of (o) tiie frei^uency function/(r) computed in Prob. 2, {b) the 
distribution function F{t) computed in Prob 3, (c) the stop-function grafih obtaineti m 
Prob. 2, with reference to an appropriate area under the curve. 

6. Continuous Distributions. Since measurements are made only to a 
certain number of significant figures, the variables which arise as the result 
of an experiment are discrete. For example, if the diameter of a shaft is 
measured to the nearest 0.01 in,, the measurement is a variable which 
assumes only isolated values, such as 3.21, 3 22, 3.23, ... in. Never!lioless 
it is convenient to introduce continuous variabl6?s, because the}^ are easier * 
to handle analytically. Siudi variables are now to be discussed. 

Let a point be chosen at random on the interval 0 < x < 1. IIow shall 
we measure the probabilities associated with that e\'cnt? If the interval 
(0,1) is divided into a number of subintervals, each of length Ar = 0.1, then 
the point x is equally likely to be in any of these subinter\'als (Fig. 7). 
The probability that 0.5 < j* < 0.8, 

. ^ 1 • r. • 1 -1 . -t . ■'« ... I 

for example, is 0.3, since there are o . 0.5 1 

three favorable cases. The probabil- 7 

ity that 0.52 < x < 0.84 is found to 

be 0.84 — 0.52 * 0.32 when we divide the interval into 100 parts, and so 
on. This reasoning sho\^s that the probability for x to be in a given sub- 
interval of (0,1) is the length of tliat subinterval. If Pr stands for '*the 

^ This remark does not justify the use of continuous variables m applied mathematics. 
The jastification rests upon the fact that discrete variables can be approximated by 
continuous ones within the experimental error. 

* In this section it will not matter w hether the intervals include tlieir end pointa or 
not. Thus, Pr (a < X ^ 5) Pr (a < x < h). 



832 PaOBABIUTT [chap. S 

probability that,” then 

Pr (o < * < 6) = b - o, 0 < o < 6 < 1. (6-1) 

When (6-1) holds, the variable x is said to be uniformly distrAuted on 
the interval 0 < x < 1. Since the expression (C-1) may be written 

Tt {a <x <b) = t dx = C 1 dx, (6-2) 

Ja Ja 

it is customary to speak of the probability density^ which in this case is 
unitj. 

More generally, a variable may be distributed with an arbitrary density 
Six). For such a variable the expression 

Jit) At 

measures, approximately, the probability that a; is on the interval 

t <x <t + At. 

An exact expression for the probability that a: is on a given interval ia,b) 
is' 

Pr (a < a: < 6) - fix) dx. (6-3) 

Ja 

This relation is illustrated in Fig. 8. 

As indicated above, the function 
fix) is called the probability density; 
the function 

Fit) - f fix) dx (6-4) 

is called the distribution function. 
Evidently, Fit) is the probability 
that X is in the interval 
in other words, 

Fit) = Pr (x < 0. (0-5) 

Fig. 8 l^fix) is continuous, then (6-4) gives 

F'it)^fit) 

and one may speak of a probability differential 

dFit) *= fit) dt, 

} The symbol x in (6-3) is used in two different senses. On the left x is a random varia¬ 
ble, and on the right x is the variable of integration. The integral could have been 

written f fiQ for example. 
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To find the distribution function t associated with the uniform density/(a;) on the 
interval (0,1) we take 

fix) ^0, X < 0, 
f(x) - 1. 0 < ® < 1, 

f(x) - 0, a: > 1. 

This expresses the fact that x is sure to be in the interval (0,1), and is uniformly dis» 
tiibuted on that interval. Hence, for 0 < < < 1, 

^’(0 f(x)dx 

- f f{x)dx +lj(x)dx 


In a similar manner one obtains 


«0 + 



t. 


(6-6) 


m « 0 , t < 0 , 

F(t) » 1, t>l, 

uhich expresses the fact that x is never <0 but is always <1. 


The following density functions arise in many applications. 
Poiason: 


e -» 0<x<oo^;i>0, r~ nonnegative integer, 

r! 

Gauss: 

1 

-- ^ \ —00 <T<00, cr>0, —«><;i<00^ 

V 27r (T 


Maxwell-Boltzmann: 


4a 



0 < a; < 00, a > 0. 


The random variable is x; the parameters /x, a, a are constants. For 
example, in the Maxwell-Boltzmann distribution x is tiie magnitude of 
the velocity of a gas molecule and a — mf2kTy where m is the mass, T 
is the temperature, and k is called the Boltzmann constant. A graph of 
the function for a = 1 is given in Fig. 8. The Poisson distribution is 
discussed in Sec. 11; the Gauss distribution in Secs. 9, 10, and 12. The 
latter is often called the normal distribution, but in this text the term 
normal disirilmtim is applied to the case <r » 1, « 0 only. 
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Densities and distribution functions are easily defined for several vari¬ 
ables. We say that /(ar,y) is the probability density (or the joint proba¬ 
bility density) for {x,y) if the probability that {x,y) is in any given region 
R of the xy plane is 

Pr ((T, 2 /) in ft] = 11 ^ f{x,y) dx dy. (6-7) 

The diatribution function is 

dx dy 

J —QO*'—00 

« Pr [x < s and y < 1], (6-8) 

Since probabilities are nonnegative, the density functions in (6-3) and 
(6-7) satisfy 

fix) > 0, fix,y) > 0. (6-9) 

Since the variables always have some finite value, in (6-3) 

1 - Pr ( — 00 < X < oo) = f f(x) dx, (6-10) 

•'—00 

and in (6-7) 

f ( fix,y) dx dy = 1. (6-11) 

J —-00 J ~.00 

Any integrable function fix) or fix,y) which satisfies these conditions 
(6-9) to (6-11) may be regarded as a probability density. The sample 
space is infinite; it consists of the events x = xq for every choice of Xq 
or ix,y) = ixo^yo) for every choice of (xo,?/o)- 

For example, if f(x,y) « 1/A in a region R of area A and f(x,y) - 0 elsewhere, it is 
easily verified that (6-11) holds. The probability that (x,y) is in a subregion Ri con¬ 
tained in is 

f dx dy jjdx dy ~ 

where Ai is the area of Ri, The variable (x,y) is then said to be uniformly distributed in R. 

The theory for finite sample spaces applies with little change to con¬ 
tinuous distributions; for example, the expectation is defined by 

/ oo 

xf{x) dx 

E{x) = / rf{x) dx = - 

" / fix) dx 

J ^co 

The latter expreeeion follows from (6-10); it shows that E(x) is the x 
coordinate of the center of mass for the area bounded by the curve y « /(x) 
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and the x axis. More generally, the expected value of any function y « g{x) 
is 

r yS{x) dx, y « g{x), (6-12) 

y —Qo 

and the sum theorem E(x + y) — Eix) + E(y) is a simple consequence 
of the properties of integrals. Compare Sec. 5, Example 2. 

Two variables a:, y are said to be itidependeni if the joint density /(x,y) 
has the form 

/(^,2/) = fix)g{y). 

Tiie theorem of compound probability for independent events 
valid in the form 

Pr (a < a: < 6, c < y < (f) 5= Pr (a < a: < 6) Pr (c < 2 / < d). 

The theorem of total probability assumes various fortUs, such as 
Pr (a < X < c) ~ Pr (a < X < 6) + Pr (6 < X < c) 
for a < 6 < c. Equation (G'15) is equivalent to 

/ fix) dx I fix) dx + / fix) dx 

Ja Ja Jh 

which, in turn, is a known property of integrals.^ 

Example 1. A variablo x is said to be uniformly distributed on (a,6) is constant 
on (a,b) and zero outside (af). Find f(x) m this case. 

Denoting the constant by c, we have 

f(x) dx I c dx ^ c(b — a) »» I 

I Ja 

by (6-10). Solving for c yields 

/(■r) “ 7 -^’ a <x <b, 

0 — a 

fix) * 0, elsewhere. 

Example 2. A stick of length a is broken at random into two pieces. Find the dis¬ 
tribution function F(s) for the length 8 of 
the shorter piece. From this find the 
probability density /(/) for the length I of 
the longer piece. 

Evidently 0 < s < a/2 in every case 
For any i between 0 and a/2 we have 
8 <t if, and only if, x is on one of the 
intervals (0,0 or (a — t, a) (see Fig. 9). 
uniformly distributed, and hence 

* It is also possible to start with the theorem of total probability and deduce from this 
that probability can be represented as a Stieltjes integral (Sec, 5). Mild continuity 
conditions then give the representation (6-3). 






Fia. 9 

The probability of that is 2^/a, since x is 



(6-1.3) 
is then 

(6-14) 

(6-1.3) 
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F(.) - 

0 


n») - 0 . 

» <0, 

F(») - 1, 

a 

Since the length 2 of the longer piece satisfies 2 — o — we have 2 < < if, and only if, 
« > 0 — 2. By the result just obtained the probability is 

1 — Pr (s < o — 

0-1 

o 

for o/2 <t<a and 0 or 1 otherwise. This gives the distribution function for L By 
difiterentiation, the density is found to b« 


0 

-<l<a, 

m - 0. 

(6-16) 

elsewhere. 


The differentiation is not valid for I « a/2 or for I ^ but it does not matter how the 
density is defined at these isolated points. 

Example 3. A stick of length a is broken at random, and the longer piece is again 
broken. What is the probability that the three segments c.an form a triangle? 

Let I be the length of the longer piece. If this piece is broken at a point a:, the three 
segments are a *- a;, i x. The condition for a triangle is that the sum of any two 

segments shall exceed the third: 

a — — X, a-~x>x, I > a 

Since I > a/2 automatically, these conditions reduce to 

It Is a conceptual aid (and not incorrect) to use the theorems of total and compound 
probability in the following manner: The probability that 2 is on the interval (2,2 + d2) is 



a 


by (6-16). 


Eonce X is 
product 


After 2 is chosen, the probability that x satisfies (6-17) is 
Jl -^/2 I 


^ 0- 2 

Jl -^/2 i I 


uniformly distributed on (0,2). The probability of both these events is the 
0-22 


2 a 


dl 


hy compound probability, and total probability now gives the final answer: 
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Mxampls 4. Needk Problem. A needle of length a is dropped on a board 

which is covered with parallel lines spaced a distance 6 > o (Fig. 10 ). What is the 
probability that the needle intersects one 
of the lines? 

We assume that the variables x and $ 
of the figure are uniformly distributed, 

X being the distance from the center to 
the nearest line. There is intersection if, 
and only if, l(o/2) cos S] > x. For fixed 6 , 
the probability of this is 

I (a/ 2 ) cos 0 1 ojcos^l 

r /2 b 

since x is uniformly distributed on ( 0 ,h/ 2 ). 

(Jsing total and compound probability as 

in Example 3, we obtain the final answer: 1^1 



I 


alcos 01 


2r 


oJL 

b2ir 


prl2 

4 cos ff d0 
JO 


xb 


PROBLEMS 


1. A probability density is defined by f(x) « 3x^ for 0 < x < 1 and f(z) » 0 else¬ 
where. Find F(x) and F(x^), Find the distribution function F(x), and from this obtab 
a value m such that x is just as likely as not to exceed m. (The value tn is called tire 
median of x.) 

2. The radius of a sphere is uniformly distributed on (0,1). Find the expected value 
of the volume (st^e ( 6 - 12 ) |. What is the probability that the volume exceeds half its 
maximum value? 

3 . A stick of length a is broken at random into two parts. What is the expected 
length of the shorter part? 

4. Two points are chosen at random on a line of length o. W^hat is the probability 
that the three segments can form a triangle? 

6 . The probability density for bullets hitting a target is given by 


fM 




where vx, Vy, mx, my are constant. Sketch the curves of constant density in the xy plane, 
What kind of curves are they? 

6 . We make two independent observations xi, X 2 of a variable with distribution func¬ 
tion /(x). What is the probability that a third independent observation X 3 will fall 
between xi and x%7 Generalisse to n observations. Hint: Use the methods of discrete 
probability. 


PROBABILITY AND RELATIVE FREQUENCY 

7. Independent Trials. It often happens that the probability of an event 
cannot be determined by counting cases or by other a priori considerations. 
Sometimes the determination is impossible in principle; for instance, 
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one cannot compute the probabilities associated with a loaded die or the 
probability that a given radio tube will fail in the first hundred hours’ 
use. Sometimes the determination is theoretically possible but impractical 
For instance, by examining every nail in a 100-lb keg one could find the 
probability that a nail selected at random will be defective, but this is 
not a useful method. 

In many such cases an estimate for the probability can be obtained by 
repeated trials (or by inspecting a suitable sample, in the terminology of 
statistics). In the case of a biased coin, for example, if 10 tosses give 7 
heads, 100 tosses give 73 heads, and 1,000 tosses give C90 heads, it appears 
that *‘the probability of heads is prolmhhj close to 0.7.'’ The two italicized 
w^ords express a reservation which is always present in conclusions such 
as this. 

The figures 7, 73, 690 in the above discussion represent the frequency 
of heads; the ratios 

7/10, 73/100, 690/1,000 

give the relative frequency in 10, 100, or 1,000 trials. More generally, 
if an event occurs m times in n trials, the relative frequency is m/n. 

The trials in a sequence of trials are said to be independent if the proba¬ 
bilities associated with a given trial do not depend on the results of pre¬ 
ceding trials. For example, the probability of heads on a given toss of a 
symmetric coin is no matter what is known about the results of previous 
tosses. But if we try to get an acc by drawing cards one at a time without 
replacing, the trials are dependent. In this case, the probability of ace 
in a given trial depends on the number of aces that may have been drawn 
previously. 

Wlien an event has constant probability p of success, the probability 
of ni successes in n independent trials may be computed as follows. A 
sequence of m successes and n — m failures is represented by a sequence 
of m letters S and n — m letters F: 

SSFFSS ... SF. (7-1) 

Since the trials are independent, the probability of any one such sequence 
is 

ppqqpp pq^ p^q^^^, (7-2) 

where ^ 1 p. To obtain the numl>er of favorable sequences, observe 

that a sequence is determined as soon as the positions of the m letters S 
axe fixed. The m places for these letters S can be chosen from the n places 
in nCm ways, and hence the required probability is 

by the theorem of total probability. 


(7-3) 
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Alternatively, the reader may imagine a sample space in which each event consistis of 
a sequence (7*1), with associated measure (7-2). Tlaen (7-3) represents the sum of the 
measures of those points favorable to the event: m successes. 

Replacing mhy x gives 

B{x) « “ 

x!(n ~ x)! 

for the probability of exactly x successes in n independent trials with 
constant probability p. The associated distribution function is 

F(t) - nCoq^ + nCipq^^^ + ’ ‘ • + nCtvY^^ (7-5) 


for integral values of L This expression gives the probability of getting 
at most t successes in n trials. 

Because of its connection with the binomial theorem (Prob. 6), the 
function B{x) is called the binomial frequency function^ F{t) in (7~5) is the 
binomial didribution^ and the statement that B{x) gives the probability 
of X succf'sses in n independent trials is called the binomial law of probor 
hihiy. Since many statistic'al studies involve repeated trials, the binomial 
law has great practical importance. 

To illustrate the use of the formula (7-4) let it be required to find the 
probability that the ace will appear exactly 4 times in the course of 10 
throws of a die. Here p = 3^, g — n — 10, x = 4. Hence the proba¬ 


bility is 


^( 4 ) 


10! /ly /5y 

41GIW W 


0.05427. 


Since the expected number of successes in one trial is p, the expected 
number in n trials is 

h{x) == np (7^6) 

(compare (5-2)]. For most distributions there is no special relation between 
the expected value and the most probable value, but for the binomial dis¬ 
tribution they happen to be almost equal. Equation (7-4) yields 

B{x -f 1) (n — x)p 

B{x) (x + l)g 

after slight simplification. Hence B{x) is an increasing function of the 
integer x if, and only if, 

(n — x)p 

- - > 1 . 

(x + l)g 

The latter inequality is the same as 

(n — x)p > (x 4- l)g 

which reduces to np > x + g, since p + g ^ 1. We have shown, then, 
that B{x + 1) > B(x) as long as x < np — g but B(x 4- 1) < i5(x) there- 
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After. Since < 1, tim establishes that S{x) is maxunxim for a value of 
X which is within 1 of the value x » np. Further discussion of the func¬ 
tion Bix) is given in the following sections. 

Example 1. Ten tosses of a suspected die gave the result 1,1, 1,6,1,1,3,1,1,4. What 
is the probability of at least this many aces if the die is true? 

The event ‘^at least 7 aces” can materialke in four mutually exclusive ways: 7 aces, 
8 aces, 9 aces, 10 aces. By total probability (or by use of the distribution function) the 
required answer is found to be 


B(7) -f B(8) *f B(9) + B(10) 

“ ioC7(K)’(^® + + iMVefm + ioCio(M)'® 

when we take p «« n «» 10. This reduces to 0.00027, approximately. Because the 
observed result has such small probability, one would reject the hypothesis ''p *» 
unless there is some other evidence in its favor. 

Example 2. In Example 1 let p be the unknown probability of the ace in a toss of 
the die. (a) For what value of p does the expected number of aces agree with the ob¬ 
served number? {})) For what value of p is the probability of the observed result a maxi¬ 
mum? 

Since E{x) np by (7-6), the observed and expected numbers agree when p « x/n, 
that is, when p » 0.7. The estimate for p given by p ** x/n is called an unbiased esti¬ 
mate, because E(x/n) p. 

For part (6), the probability of getting 7 aces and 3 other numbers is 

or laCip'^q^, g « 1 - p. 


depending upon whether the order is considered or not. In either case the probability 
is maximum when p^(l — p)* is rnaxunum. This, in turn, is maximum when 

log p\l - p)* 7 log p + 3 log (1 - p) 

is maximum. Differentiation gives 

p 1 ~ p 


or p « 0.7. An estimate for p sucih as this, which maximizes the probability of the ob¬ 
served result, is called a maximum likelihood estimate. 


PROBLEMS 

1 . When 5 coins are tossed what is the probability of exactly 2 heads? At least 2 
heads? What is the expected number of heads? The most probable number of heads? 

2. If 6 dice are tossed ^simultarleouflly, what is the probability that (a) exactly 3 of 
them turn the ace up? {h) At least 3 turn the ace up? 

3 . If the probability that a man aged sixty will live to be seventy is 0.65, what is the 
probability that out of 10 men now sixty at least 7 will live to be seventy? 

4 . A man is promised $1 for each ace in excess of 1 that appears in 6 consecutive 
throws of a die. W^hat is the value of his expectation? 

6. A bag contains 20 black balls and 15 white balls. What is the chance that at 
least 4 in a sample of 5 balls are black? 

0 . (a) By use of the binomial theorem show that 
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(g + ptr - Bm -f B{1)^ -f- B(2)fi + • - + B{n)t\ 

(h) Interpret the I entity which arises when ^ « 1. (<?) Differetitiate with respect to t, 
and interpret the identity which then arises for f ■» 1. [The function (q -f pi)** is 
called the generating functim of the sequence {i5(a;)).] 

7. (a) One hundred light bulbs were tested for 600 hr, at the end of which time 57 
bulbs had failed. Obtain an unbiased estimate and also a maximum-likelihood esti¬ 
mate for the probability of failure in 500 hr. (6) Are these two estimates of p always 
equal for the binomial distribution? Hint: In (5), compare the result of maximizing 
pm^n-m regp^^ct to p and the result of choosing p so that E{x) * w, where m is the 
number of observed successes. 

8. In a certain agricultural experiment, the probability that a plant will have yellow 
flowers is If 10,000 plants are grown, what is the probability that the number with 
yellow flow'crs will be between 7,400 and 7,000? (To appreciate latter developments ob¬ 
serve that your answer, which should be indicated only, is difficult to compute.) 

8. An Illustration. Some interesting conclusions concerning the bino¬ 
mial law are suggestcid by an example that presents many features of the 
general case. Consider a purse in which are placed 2 silver and 3 gold 
coins, and let it be required to find the probability of drawing exactly x 
silver coins in n trials, the coin being replaced after each drawing. The 
probability of exactly x successes in n trials is given by (7-4) where p, 
the probability of drawing a silver coin in a single trial, is %. If the 
number of drawings is taken as n = 5, 10, or 30, the respective frequency 
functions B(x) are 

B(x) = 6Cx(%)*(%)®-*, n = 5, 

B(t) = n = 10 , 

B(x) = n = 30. 

By use of these expressious one can compute the values of B{x) to any desired accu¬ 
racy. The result of such a computation to four places of decimals is presented in the 
accompanying tabh's. In the third table the entry 0.0000 is made for 0 < .t < 2 and 
for X > 23 because in these cases B(x) was found t^o be less than 0.00005. For example, 
the probability of drawing exactly 23 silver coins in 30 trials is 

B(23) » = 0.000040128. 

The reader can verify that the most probable values of x are exactly equal to np 
(and not merely within 1 of np). This behavior is always found when np is an integer, 

pROBABILITT OF EXACTLY X SuCCESSKS IN 5 TrIALS 


X 

i 

B(z) 

X 

B(x) 

0 

0.0778 

3 

0.2304 

1 

0.2592 

4 

0.0708 

2 

o.$4^e 

5 

0.0102 
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PBOBABItITY OF EXACTI^Y X ScCCBBaBS IN 10 TrUI#8 


X 

Bix) 

X 

B{x) 

X 

B(x) 

0 

0.0060 

4 

0 . B 60 S 

8 

0.0106 

1 

0.0403 

5 

0.2007 

9 

0.0016 

2 

0.1209 

6 

0 1115 

10 

0.0001 

3 

0.2150 

7 

0.0425 

1 




Probability of Exactly x Succesbbs in 30 Trials 


X 

B(i) 

1 

X 

B{x) 

X ! 

B{x) 

<2 

0.0000 

9 

0.0823 

16 

0.0489 

3 

0.0003 

10 

0.1152 

17 

0 0269 

4 

0.0012 

11 

0.1396 

18 

[ 0 0129 

5 

0.0041 

12 

OAm 

19 

0.0054 

6 

0.0115 

13 

0.1360 

20 

0,0020 

7 

0.0263 

14 

0.1100 

21 

0.0006 

8 

1 0 0505 

15 

0.0783 

22 

0.0002 





>23 

0.0000 


The values given in the tables are presented graphically in Fig. 11 after 
the manner described in See. 5. Each curve has the general shape pre¬ 
dicted by the theory of the preceding section, but the figure shows also 
how the shape changes as we proceed from one curve to another. The 
numerical area under each curve is 1, although the curves become broader 
and flatter as n increases. In particular the maximum (that is, the proba¬ 
bility of the most probable value) decreases as n increases. This is just 
what one would expect intuitively. (For instance, one could easily get 
2 heads in 4 tosses of a coin, but one would be surprised to get exactly 
500,001 heads in 1,000,002 tosses.) The fact that the curves become 
broader indicates that the values of x experience a wider spread when 
there are more trials, and this, too, one would expect. Naturally, the 
curves ought to get broader if the maximum is to decrease while the area 
remains equal to 1. 

The foregoing discussion is concerned with the frequency of success in 
n trials. The results are very different if, instead, one considers the relative 
frequency x/n. The distribution for the variable x/n is presented graphi¬ 
cally in Fig. 12. These curves were obtained from the preceding by the 
change of scale indicated on the axes, and hence, the area is still 1. Instead 
of becoming broader, these curves become narrower as n increases. The 
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relative frequency xjn tends to cluster about its expected value p bb n 
gets large. It is for this reason that relative frequency can be used to 
estimate an unknown probability. 



Fia 11 

The behavior suggested by this example may he summarized as follows. 
When the number of trials n becomes large, the absolute deviation from 
the expected value 

\x — np| — |x — E(x) I 

iwB(x) 



Fig, 12 

is likely also to be large, but the relative deviation 



is likely to he sinall.^ 

^ It will be seen in Sec. 9 that the first expression is usually of the order \/n and the 
second, of order l/y/n\ compare Prob. 3. 
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PROBtEMS 

1. Plot a distribution curve like that of Fig 11 for the probability of x successes in 4 

trials when p Shade the area corresponding to the event 1 < a; < 3, and find 

the probability of this event. 

2. For p « M plot the probability of the most probable number of successes versus n. 

(Take points at w » 1, 2, 3, 4, 5^10, 30 only, cf. Prob, 1 and accompanying tables.) 
On the same figure pl ot ll\ /2^pq versus n. (It is shown in Sec. 9 that the probability 
i« asymptotic to when n is large. This expression appioachos zero as n —► <», 

even though we are considering the mmt probable value ) 

3. Using the tables and your numerical values in Prob. 2, plot y/n B(x) versus 
(x — np)fy/n for p » % and for n « 3, 4, 5, 10. l^se the same scale in each case. 
Formulate a conjecture concerning the behavior as n -+ oo, and test your conjecture 
by plotting five well-chosen points on the curve corresponding to n *« 30. 

9. The Laplace-de Moivre Limit Theorem. Numerical computation of 
the binomial distribution is difficult when n is large. In this section an 
approximate formula is obtained when n and np are both large. In See. 11 
a formula is found when n is large but np is not large. Tfiese approxima¬ 
tions, together witli the exact formula when n is moderate, cover all cases. 

The analysis is based on the Stirling formula, 

n! ~ n-e-" V2^, (9-1) 

wliieh is made plausible by the following dLs~ 
cuasion. Consider the function y « logx, 
and observe that for k >2, 



f 

Ji~i 


log xdx > Hllog (A - 1) + log k], 


log a;. 


since the right-hand member represents the 
trapezoidal area formed by the chord (Fig. 13) 
joining the points P and Q on the curve 
Denote the area between the chord and the curve by a*, so that 


r log xdx 
-1 


VzWogik ~ 1) -f log/c] +afc. 


(9-2) 


Setting k — 2, 3, ..., n in (9-2) and adding give 

j log X dx * H(log I 4- log 2) 4- HQog 2 4* log 3) d- 

4- (log (n - 1) -h log n] 4* (02 4* 08 d-h On). 

Integrating the left-hand member and combining the terms of the right-hand member give 

n 

n log n - n 4“ I “ log nl - H log n -f X) 

*««2 


log n! - (n -h J4) log n — n 4-1 — 2 o<. 


Senoe, 


(M) 
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Since each Oi k positive, it follows that 

log n! < (n -f H) log n — n 4* 1 

and hence 

n! < eVnn**e~". (9-4) 

The expression on the right of the inequality (9-4) is, therefore, an upper bound for nl. 
To get a lower bound, solve (9-2) for Oji, perform the integration, and obtain 


ak 




Now, since the integrand is nonnegative, 


r (---V 

Jk-i \x k/ 


dx>0 


and the evaluation of (9-6) leads to the formula 


log- 


2k 


/c - 1 2k{k - 1) 
By use of this inequality, (9-5) gives 

1 

ak <- 

4k{k - 1) 4 ^ 


4 Vfc - 1 kJ 



(9-6) 

(9-6) 


By means of this result and (9-3), one obtains 

log n! > (w + H) log n — n -f- 1 — 

whence n! > e^^\/n (9-7) 

Combining (9-4) and (9-7) furnishes the inetjuality ^ 
n < n’ < e^/n 


for all values of n > 1. Since e » 2 718, =* 2 117, and ■\/'2tc «= 2.507, we have shown 

that (9-1) is correct as to order of magnifude. More refined methods establish that the 
error is less tlian 10 per cent for n > 1, less than I per cent for n > 10, and less than 
0.1 per cent for n > 100. Moreover, the percentage error approaches zero as n -+ oo, 
so that the equality is asymptotic. 

In the expression 

B(r) (9-8) 

r!(n — r)! 


for the probability of r successes in n independent trials, we assume that 
r, n, and n — r are large enough to permit the use of Stirling's formula 

* The derivation of this result is given by P. M. Hummel, Am. Math, Monthly^ 47:97 
(1940). 
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(9-1). Replacing nl, r!, and (n — r)! by their approximations gives, after 
simplification, 




t) 


(9-9) 


Let & denote the deviation of r from the expected value np; that is, 

6 = r — np. 
n — r = ng — 5 

(np4-5) / 5) 

nqj 


Then, 

and (9-9) becomes ^ 


B(t) 


or 


where 


( e \ ~(np+g) / P V 

i+_) (i--) 

np/ \ nq/ 


\ np/ \ nq/ 


Then, log B(r)A ^ — (np + 6) log (1 H-) — {nq — 6) log ( 1 —) • 

\ np/ \ nq/ 


Assuming 151 < npq^ so that 


b 

< 1 

and 

8 

np 



nq 


< 1, 


permits one to write the two convergent series 


and 


Hence, 


log (1 + = 

\ np/ 

log(l --) = 
\ nq/ 


d 

np 


+ 


5^ 


2n^p‘‘ Sny 


3^3 


nq. 

log B{r)A ^ — 


b b^ b^ 
nq 2n^(f 3n^q^ 

_iL _ - 9^) 

2npq 2-Zn‘p^^ 


3-4n®p*g® 


Now, if I i I is 80 small in comparison with npq that one can neglect all 
terms in this expansion beyond the first and can replace A by 

1 Here and in similar cases which arise subsequently, we assume that p 0 and 
q 3 ^ (k The cases p « 0 or p « 1 can be dealt with by inspection. 
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thcD there results the approximate formula 


Bir): 


— e 


4*/2nji« 


(9-10) 


\/ 27rnpq 

which bears the name of Laplace^s, or the normal^ approximation. With 
cr = \^npq , Eq. (9-10) becomes 

1 


B(r): 


\/2ir a 


p-«*/2<r2. 


(9-11) 


The equalit}^ is asymptotic; that is, the ratio of the two sides tends to 
1 as 7? A comf)arison of B{r) with the normal approximation is 

given in Fig. 14. 



The main usefulness of this result is to compute the probability 

E B{r) (9-12) 

r«.ri 

that the number of successes is between the given limits ri and r 2 . Equa¬ 
tion (9-11) shows that the sum (9-12) may be approximated by a sum 

S (9.13) 

V2ir a 


over appropriate values of 5. Since 5 = r — np, the difference between 
successive values of 5 is 1, and hence if we let t — 8/cr^ the difference be¬ 
tween successive values of / is M = 1/cr. Thus (9-13) becomes a sum over b 


\/^ 


-<V2 


AL 


(9-14) 
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As M --4 0, the expression (9-14) approaches an integral, which may be 
evaluated in terms of the function 

1 








(9-16) 


tabulated in Appendix D. These considerations yield the following funda¬ 
mental result, known as the Laplace-de Moivre limit theorem: 

Theorem. Let x be the number of successes in n independent trials with 
constant probability p. Then the probability of the inequality 


X np 

t\ ^ ^ ^2 

wnpq 

approaches the limit 

#(/ 2 ) - Hh) 

V 2ir •'<1 

as n 00 . 


(9-16) 

(9-17) 


To complete the proof one must note that the error in passing from (9-12) to (9-13) 
ifi small for large n even when the number of terms in the sum is large A more de¬ 
tailed analysis, taking due account of this (question, is given in William Feller, ‘Troba- 
bility Theory and Its Applications," pp, 133-137, John Wiley <fe Sons, Inc., New York, 
1950, It is shown tliat a better approximation is given by 




Vnpg, 


(9-18) 


although the improvement is not important when n is large An expression for the 
error in the approximation is derived in J V. Uspensky, “Introduction to Mathematical 
Probability," p. 129, McGraw-Hill Book Company, Inc , New York, 1937 


To illustrate the use of the result (9-17), let us find the probability that 
the number of aces will be between 80 and 110 when a true die is tossed 
600 times. Here n = GOO, p - K, g = ?6; and x varies from 80 to 110. 
Hence 


80 - 100 

= -~jr==:==r— = —2,19 and 

v(100)(%) 


_ JllO - 100 

" V(iTOj(%) 


1.09. 


The table gives 4>(f2) ~ 4>(1.09) - 0.362, and similarly 


<I)(-2.19) = -4>(2.19) = -0.486. 

[Observe that 4>(—0 = —^(t), since the curve y == is symmetric.] 
Hence the required proba))ility is, approximately, 

0.362 - (-0.486) - 0,848. 

Example 1. In the notation of the text, the probability P^ax of the most probable 
value of r satisfies 

^ Pjnxuc 


when n is large. 


^/2irnpq 


(9-19) 
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In Sec. 7 it was found that the most probable value is of the form 

f «* np 4- 1^1 < JL 

For this value of r we have 

5 a» r — np » 

and hence Eq. (9-10) shows tlmt the associated probability is asymptotic to 


__ 

\^2wnp(2 

As n —♦ 00 , the exponential tends to 1, since 9^ is bounded, and this yields (9-19). 

Example 2. In an agricultural experiment Mendelian theory yields a probability 
p « ^ that any given ))lant should have blue flowers. Out of 10,000 plants it was 
found tliat 2,578 had blue flowers. Does this result contradict the theory? 

According to theory, there should have been 2,500 plants with blue flowers; that is, 
the expected number is np « 2,5(X). There were, in fact, 78 more than this. We have 
to decide if this excess is too large to be attributed to chance. 

Let us find the probability that the excess will be 78 or more if the hypothesis p w 
is indeed correct. The inequality 

78 < a: - np (9-20) 


becomes 

when divided by V^^PV 


r — np 

1.801 < < 00 (9-21) 
Wnjyq 

43.3. According to the table the probability of (9-21) is 


<I>(a>) <l>(1.80l) = 0 500 - 0 464 « 0.036, 


Now, in a statistical test it is customary to reject the hypothe.ms if the hypothesis makes 
the probability of the observed result i(‘NS than a fixed (juantity a deterinmed before¬ 
hand The value a (which is called the mqtuficance levd of the test) is oftem taken to be 
0 05. Since our probability 0 036 is less than 0 05, the experimental outcome is con¬ 
sidered too unlikely to be attributed to chance, and we reject the hypothesis ^‘p «* 

In this siuisc, the exf>eriment contradnds JMendelian theory. 

We now give another analysis which leads to the opposite conclusion. Instead of 
saying ‘The excess was 78,” one could just as wadi say, “the discrepancy w as 78,” meaning 

|a:-npl- 78. (9-22) 

Both statements are equally valid descriptions of the experimental outcome. The 
probability of 

I a: — np I < 78 

is found, as above, to be 

- ‘l»(-1.801) « 0.928, 

and hence the probability of the contrary event is 

1 - 0 928 « 0.072. 


Since 0.072 > 0.05, a discrepancy of “78 or more” is sufficiently probable to be at¬ 
tributed to chance (if, as before, our significance level is 0.05). Hence the hypothesis 
is not contradicted by the experiment.* 

* When the probability exceeds the significance level, as in this case, the hypothesis is 
not thereby proved but it is considered to have withstood the exjf>erimental test. 
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It requires statistical methods of considerable subtlety to decide between competing 
tests of a hypothesis such as the foregoing. These methods show that the fii'st procedure 
is appropriate for t^Ksting the hypothesis ^‘p against the alternative > W' 

whereas the second is appropriate for testing the hypothesis against the alternative 
^ K-*’ A very readable account of the subject is given in P. G. Hoel, '‘Introduction 
to Mathematical Statistics,” chap. 10, John Wiley <fe Sons, Inc., New York, 1954. 


PROBLEMS 


1. Two dice are tossed 1,000 times. What is, approximately, the probability of get¬ 
ting a sum of 4 the most probable number of times? 500 tina^s? (Tso a table of ex¬ 
ponentials.) 

2. A true coin is to be tossed 1,600 times, and it is desired to find the probability that 
the number of heads x will satisfy 780 < a* < 830. (a) Show that this inequality is 
equivalent to 


-1 



< 1.5. 


(6) Express the probability of the latter inequality in terms of 4> by means of the normal 
law. (c) Using the table, evaluate the probability. 

3. By means of the normal law, obtain an approximate numerical answer to Prob. 8, 
Sec. 7. 

4. A machine has a probability p *= 0 01 of producing a defective bottle In oia^ 
day's run, out of 10,000 bottles, 120 were defective, k'ind the approximate piobability 
of at least this many defectives if the machine is running as usual 

6. A suspected die gave only 960 aces in 6,000 tosses. If the die is true, (a) \\ hat is 
the probability of getting at most 960 aces in 6,OCX) to.sses? {b) What is the probability 
of getting a discrepancy \x — np| of “40 or more”? (c) At a significance level of 0.05, 
does either calculation indicate that the die is loaded? 


10. The Law of Large Numbers. Since 2 B{r) = 1 for each value of n, 
it is natural to expect, by the foregoing analysis, that 


1 

\/2t 



1 . 


(10-1) 


For a direct proof of (10-1), define I by 


1 = f 

J — 00 —00 


( 10 - 2 ) 


Then multiplication of the two expressions (10-2) yields 

p = r r (10-3) 

J — oa J —00 


after changing to a double integral. In polar coordinates, 


n oo /*Q0 

dr d6 = 2ir {■p/2) =* 2v (10-4) 
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SO that I ~ ■\/27r, and (10-1) follows. The transformations leading to 
(10-3) and (10-4) are justified by the fact that (10-3) is an absolutely con¬ 
vergent double integral. 

Equation (10-1) shows that the function 


1 


r = 4>(0+- 

j —00 9 


is a distribution function; it is called the normal distribution. The theorem 
of the preceding section asserts that the variable h/or is approximately 
nonnally distributed when n is large. This fact will now be used to es¬ 
tablish the following fundamental result, which is a special case of the 
so-called law of large numbers: 

Tiikokem. Let x be the number of successes in n independent trials with 
constant probability p. If t is any positive number, then the probability of the 
mequality 

k I 


- - p 

n 


< € 


(10-5) 


tends to 1 as n —> oo. 

In other words, the relative frequency of the event is almost sure to be 
close to the probability of tlie event when the number of trials is large. 
For proof, write the inequality (10-5) in the form 


which becomes 


X — np 

-€ < -< € 

n 



( 10 - 6 ) 


when multiplied by \/7 //pq Cliven any number U) (no matter how large), 
we can choose n so that ty/n^pq > In this case the probability of the 
inequality (10-6) is at least equal to the probability of 


As n 


X — np 

— ^0 ^ 7 ”—“ ^ 0 * 

V npq 

oo, the latter probability ^ tends to 


1 

y/2Tr 



(10-7) 


( 10 - 8 ) 


by (9-17). Since to is as large as we please, Eq. (10-1) shows that the in¬ 
tegral (10-8) is as close to 1 as we please, and this completes the proof. 

' One must not apply (9-17) directly to (10-6), because (9-17) was obtained only for 
fixed ii and h whereas the limits in (10-6) depend on n. 
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Tlie theorem was established first by James Bernoulli (1654-1705) 
after 20 years of effort. The law of large numbers lies at the basis erf all 
attempts to estimate a probability experimentally, and it affords a phil¬ 
osophical justification for such attempts. In fact, some developments of 
the subject define probability in terms of relative frequency, by the formula 
p := Urn {x/n) as n Qo, and rely on the law of large numbers to ensure 
that the limit exists. 

The theorem makes possible some interesting computational procedures, 
known as Monte Carlo methods. Although the method is not to be discussed 
at length here, we sketch an example that illustrates some of the main 
features. Suppose a man walks in a straight line, taking a step of length 
h ft every s sec (see Fig 15). Each step is equally likely to be to the right 

h 

Fig. 15 

or to the left, vrithout regard to the preceding steps. Assuming that x 
is a multiple of h and t is a multiple of s, it is required to find tlie probability 
that the man is x ft from his starting point at time L 

Let U{xd) stand for the probability in question; that is, U{xJ) is the 
probability of the man’s being at point x at time t if he was at point .r = 0 
at time t — 0. Now\ lie cfin arrive at point x at time ^ + s in two ways. 
Either he was at point x + h at time / and took a step to the left, or he 
was at point a: — /i at time t and took a step to the right. The probability 
of being at x + at time t is U{x + h, t) by the definition of f/, and the 
probability of a step to the left is Yz by hypothesis. Hence the probability 
of botli events is 

y2C{x + K i) 

by compound probability. In just the same way the probability of being 
hix ^ h and then stepping to the right is 

YiUix ^ K 0. 

By total probability, the probability of getting to the point x at time 
t + sis the sum, and w^e are thus led to a difference equation for C/, 

f/(x, t + s) = YU{x + h,t) + YU{x - h,t). (10-9) 

The boundary conditions are 

uir,0) =0 foTx 9^0, x; U{x,t) =. 1 (]0-9o) 

X 

which express the fact that he is sure to be at the origin when t =« 0 and 
sure to be at some point x for all t 
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To apply the Monte Carlo method to this problem, we make a large 
number of actual random walks experimentally. The number of times 
we arrive at point x at time i gives an estimate for the probability U{x^i) 
by virtue of the law of large numbers. Hence, the calculation yields an 
approximate solution of the problem (10-9) without any direct use of 
(10-9). In practice, the ‘^random walks'" are made on a computing machine 
by reference to a set of random numbers. Similar methods apply to 
difference equations of much greater complexity than (10-9). 


For readers familiar with the theory of heat conduction the foregoing example yields 
an interesting interpretation ^ of the normal law. Subtracting U{x,t) from both sides 
of (10-9) and dividing by s give 

f + «) ■“ V{x,i) rf^(^ 4* A, 0 - 2lJ{x,i) -f V{x - K 

■ s "' 2 a 


If we set s ^ and let h 


with boundary condition 

r/(j,0) 


0, this becomes, formally,* 

W 1 dHJ 
dt "" 2 dx^ 


0 for X 0, 



1 . 


(10-10) 

(lO-lOa) 


Since these are the conditions for an inst^intaneous source of heat at the origin, a solution 
ifl • 


Vix^t) 


1 

V27t 




(10-11) 


Now, in the random walk the probability of a steps to the right and h to the left is 
given approximately by the normal approxiination (9-10); it turns out to be 




7r(a + b) 




( 10 - 12 ) 


If the man arrives at point x at time he makes t/s steps altogether and x/h more steps 
to the right than to the left: 


d b 


Substitution in (10-12) and setting 5 » yield 

1 


(/(x,0 ^ 


•\/ 2rrt 


-^f-\2h) 


(10-13) 


for the probability. Here 2h is the disianc<‘ between possible values of x when i is fixed, 
and hence the coefficient of 2h may bo regarded as a probability density. The condition 

^ Since heat is due to random motion of the molecules, the analogy of the random-walk 
problem with the problem of heat flow has a physical basis as well as the mathematical 
basis outlined in the text. 

* See Chap. 6, Sec. 26. 

* See Chap. 6, Beo. 19. 
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*‘h amair’ means simply that the number of steps is large, so that the normal law is ap^ 
pUcable. The analogy between (10-11) and (10-13) is evident. 

The discussion shows that not only (10-9) but the problem in heat flow given by 
(10-10) may be attacked by making random walks. Some of the main applications of 
Monte Carlo methods are, in fact, to the study of partial differential equations. 

Example: A true coin is tossed repeatedly. It is desired to have a probability of 0.99 
that the relative frequency of heads shall be within 1 per cent of the probability of 
heads. How many times must the coin be tossed? 

If the coin is tossed n times, the desired inequality is 





(10-14) 

which is the same as 


1 p 1 



-0.01 ^ 

y/npa ^7 

(10-15) 


Setting the probability of (10-15) equal to 0 99 and noting that p = g, we get 
0.99 = <t>(0 01\/w) - ^(-0 01\/?i) « 2<I>(0.01\/^) 
by the normal approximation. The table gives 

OOlVn =■ 2.58, 

SO that n »» 07,000 approximately. The fact that a problem such as this will always 
yield a finite value for n is the essential content of the law of large numbers. Apfilying 
the law of large numbers in another fashion, we can interpret the result more or less as 
follows: If the whole coin-tossing experiment is repeated a great many times, in about 
99 per cent of these experiments the inequality (10-14) \\'ill bo verified. 

PROBLEMS 

1. In the Example of the text, how many timt‘S must we toss the coin to make the 
probability 0.95 that the relative frequency is within 5 per cent of the‘ probability? 

2. On the average a certain student is abU* to solve 60 per c(‘nt of the problems as¬ 
signed to him. If an examination contains 8 problems and a minimum of 5 problems 
is required for pa.ssmg, what is the student’s chance of passing? Hint Because of the 
law of large numbers, you may take the statement about the student’s average per¬ 
formance to mean: 'TIis probability of solving any given problem is 0.6.” 

8 . If Paul hits a target 80 times out of 100 on the average and John hits it 90 times 
out of 100, what is the probability that at least one of them hits the target when they 
shoot simultaneously? 

4 . If on the average in a shipment of 10 cases of certain goods 1 case is damaged, what 
is the probability that out of 5 cases expected at least 4 will not be damaged? 

ADDITIONAL TOPICS IN PROBABILITY 

11. The Poisson Law. In the problem of repeated trials it may happen 
that p is too small to permit the use of the normal approximation even 
though n is large. A different approximation, which is called the Poisson 
law or the law oj small numbers, is now to be obtained for this case. 
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Starting with the formula for the probability of r successes in n trials, 

B{r) - ~ 

r\{n — r)\ 

we replace n! and (n ~ r)! by their Stirling approximations to obtain 


B(r) 


n^e 




r!(n 


- r) 


pV - P)”- 


71 e 


r![l - (r/rt) 


- vT-'. 


(ll-l) 


Since the expected value of r is rip, we can assume that r is small compared 
with n. In this case ^ 


i'-T 


Similarly, since p is small, 

(1 - ^ (1 - p)^ = [(I - ^ 

Substituting these two expressions into (11-1) yields the desired law of 
small numbers: 

(npY _ 

B{r) ^- e n large, np moderate. (11-2) 

r! 

The result may be written 

(11-3) 

r\ 


where n ^ np is the expected number of successes. 

An application of this law to some specific cases may prove interesting. 
Suppose it is known that, on the average, in a large city 2 persons die 
daily of tuberculosis. What is the probability that x persons will die on 
any day? In this case the expected number of deaths is ^ = 2, so that 

2* 

B(x) =-e-2 
z\ 

' The reader is reminded that lim (1 -f « e as h approaches zero through posi¬ 
tive or negative values. See I, S. Sokolnikoff, “Advanced Calculus,” pp. 28-31, McGraw- 
Hill Book (Company, Inc., New York, 1939. 
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1 

X 

B(x) 

X 

B(x) 

X 

B{x) 

0 

0.135 

2 

0.271 

4 

0.090 

1 

0.271 

3 

0.180 

5 

1 0.036 


The Poisson law has a significance far l)eyond its connection with the 
binomial distribution, as will now be shown. Suppose points are dis¬ 
tributed at random on the x axis in such a fashion that the following 
assumptions are valid : 

1. The probability that a given number of points is in a given interval 
depends only on the length of that interval (and not on any information 
we may have about the points in adjacent intervals). 

2. If P(Aa:) is the probability of 2 or more points in an interval of 
length Ax, then P(Ax)/Ax 0 as Ax 0. 

3. If Pi (Ax) is the probability of 1 point in an interval of length Ax, 
then Pi (Ax)/Ax —> /c, a constant, as Ax —^ 0. 

In tins case the probability Pn{x) of n points in an interval of length x 
satisfies the Poisson law 

P„(x) - (11-4) 

nl 


To prove this result, consider an interval (0, x 4- Ax) of length x -f Ax. We can 
have n points in this interval in three mutually exclusive ways. Either there are n 
points in x and none in Ax, or there are a •— 1 in x and 1 in Ax, or there are fewer than 
n — 1 in X and at least 2 in Ax. The probability of this last alternative may be written 
c Ax, where « 0 with Ax, in view of assumption 2. 

Thus, by total and compound probability, 

P«(x 4- Ax) « P„(x)Po(Ax) 4- Pn~i(x)Pi(Ax) 4- « Ax. 


Subtracting Pn(x) from both sides and dividing by Ax give 


Pnjx 4- Ax) - Pn(x) 
Ax 


Pn(x) 


Po(Ax) — 1 
Ax 


4* Pn~l(x) 


Pi(Ax) 

Ax 


4*«. 


(11-5) 


Since there must be no point, 1 point, or more than 1 point in an interval of length 
Ax, We have 


which gives 


Fo(Ax) 4- Pi(Ax) 4- P(Ax) - 1 
Po(Ax) - 1 Pi(Ax) P(Ax) 

. OB —• .. .. —» .... — , 

Ax Ax Ax 


( 11 - 6 ) 
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Taking the limit Ax 0, we obtain —A; in (11*^), and hence taking the limit in (11-6) 
gives 

£ P«(*) - -kPnix) + fcPn-i(x), n 2; 1. (11-7) 

For n » 0 the term Pn-i(x) is to be replaced by zero, so that 

j-Po(x) - -fePo(*). 
ax 

This separable differential equation yields 

Po(x) « « 6--** 

where the constant c ** 1 since Po(0) w 1; that is, an interval of zero length is sure to 
contain no points. (This follows from assumption 2.) 

Substituting Po(x) in the relation (11-7) for n «• 1 we get 

£pi(*) - -fePi(i) + le-** 

which yields Pi(x) » e^^{kx). Proceeding step by step or using mathematical induc¬ 
tion, wc obtain (11-4). 

The following are some of the phenomena which satisfy the assumptions 
1 to 3 quite accurately and which, accordingly, obey a Poisson law: the 
distribution of automobiles on a highway, the distribution of starting 
times for telephone calls, the clicks of a Geiger counter, the arrival times 
for customers at a theater ticket office. The first example is a spatial 
distribution, while the last three refer to distributions in time. 


Example 1. What is the probability that the ace of spades will be drawn from a deck 
of cards at least once in 104 consecutive trials? 

This problem can be solved with the aid of the exact law (7-4) as follows; The proba¬ 
bility that the ace will not be drawn in the 104 trials is 

P(0) « “ 0.133 

and the probability that the ace will be drawn at least once is 1 — 0.133 *» 0.867. On the 
other hand, Poisson’s law (11-2) gives for the probability of failure to draw the ace 

5(0) - e-’oHs e-\ 

0! 


Hence, the probability of drawing at least one ace of spades is 1 — e""* «* 0.865. 

Example 2. Show that the constant k in the Poisson law (11-4) represents the ex¬ 
pected number of points in a unit interval. 

Since the probability of n points in a unit interval is 


Pn(l) 



00 


M{n) - X; e-* 

n—1 


n! 


n 


- e-**z; 


ft"-* 

(n - 1)1 


- «-**«* - k. 


the expected number is 
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PROBLEMS 

1 . By use of the Poisson law compute the probability of (a) just one ace in 6 tosses of 
a die, (b) just one double ace in 36 tosses of a pair of dice. C/Ompare the binomial law 
for cases (o) and (fe). Which of the two cases satisfies the assumptions of the text more 
exactly? 

2 . The probability is 0.0025 that a nail chosen at random from the output of a cer¬ 
tain machine will be defective. What is the probability that a keg of 1,000 nails made 
by the machine will have at most 3 defective nails? Hint: The keg has ‘'at most 3” if 
it has 0, 1, 2, or 3 exactly. Use the Poisson approximation. 

8 . In Prob. 2 it is desired to have a probability of at least 0.95 that the keg has at 
least 1,000 good nails. How many nails should the manufacturer put into the keg? 
Hint: If he puts in n «• 1,000 -|- m nails, he wants a probability 0 95 that the number 
of defective nails will be at most m. Use the Poisson law, taking np ^ l,000p » 2.5. 

4 . On a certain one-way highway it is proposed to install a traffic signal which has a 
60«sec red interval but a long gieen interval. The speed of the cars may be taken as 
30 mph, and the expected number is 10 cars per mile of highway. Neglecting any 
effects of slowing down, find the probability that just n cars will be obliged to stop 
when the light is red. What is the probability that at most 5 cars must stop? What 
is the exfiected number that must stop? Hint Assume that the cars are distributed 
according to the law (11-4), and see Example 2. 

8 . A certain circuit can transmit 3 telephone calls simultaneously The expected 
number of incoming calls is 1 per minute, and each call lasts 3 min. What is the proba¬ 
bility of getting a busy signal? Hint: You will find the line busy if 3 calls or more have 
come in during the preceding S-min interval. Use (13-4). 

12. The Theory of Errors. In this section the methods of probability 
are used to analyze the effect of experimental errors in measurement. If 
n independent measurements give the values mj, W 2 , ...» we consider 
questions such as the following: What is the best estimate for the quantity 
being measured as determined by these measurements? What is the 
probability that this best estimate is within 1 per cent, say, of the true 
value? How much added precision is gained by increasing the number 
of measurements? 

Proceeding to the first question, let mi and m 2 be two independent 
measurements of an unknown quantity m (such as the mass of an electron, 
for instance). It is desired to find a best estimate for m based on the 
measurements mi and m 2 . To this end we denote the best estimate by 
d(mi,m 2 ) and seek to determine the function 6, Now, if both measure¬ 
ments are increased by a given amount a, it seems reasonable to assume 
that the estimate also increases by the amount a. In symbols, 

e(mi + a,m 2 + a) ^ 0 {mi,m 2 ) + a. (12-1) 

This relation is now postulated. 

Similarly, if mi and m 2 are multiplied by a fixed quantity jS, it is reason¬ 
able to suppose that the best estimate is likewise multiplied by j8. This 
requirement leads to 
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« $B{mi,m2) (12-2) 

which is also postulated. [Equation (12-2) is quite obvious when we con¬ 
sider the effect of a change of units. For instance if grains are used in¬ 
stead of kilograms as the unit of mass, we expect the estimate in grams to 
be 1,000 times as great as the estimate in kilograms,] 

Finally, since the two experiments are carried out under substantially 
identical conditions, it does not matter which experimental result is mi 
and which is m 2 . We are thus led to postulate that 6 is symmetric: 


0(mi,m2) — ^(m2,mi). 


(12-3) 


It is a remarkable fact that the best estimate is wholly determined by 
these requirements; if 6 satisfies (12-1) to (12-3), then 6 must be the arithmetic 
mean, 


e{mi,m2) 


mi + m 2 
2 


(12-4) 


To establish (12-4), regard mi and m 2 as fixed and choose a ■■ —m 2 in (12-1). There 
results 

^(mi,m2) =* m 2 -h ^(mi — m 2 , 0). (12-6) 

If this expression for is used in the left-hand member of (12-2), one obtains 

0m2 + e{^mi - 0) * /30(mi,m2). (12-6) 

Whenever mi 5 ^ m 2 , the choice /3 * l/(mi — m 2 ) in (12-6) gives 

m 2 -h ^(l,0)(mi — m 2 ) «= (12-7) 

if we multiply through by mi — m 2 And now (12-3) leads to 

m 2 + ^(l,0)(mi — m 2 ) « mi -h ^(l,0)(m2 mi) 

which implies 0(1,0) =“ Hence (12-7) yields (12-4). The case mi •* m 2 is even sim¬ 
pler; specifically, Eq. (12-5) gives 

0(mi,mi) «* mi -f 0(0,0) (12-8) 

and the choice /9 « 0 in ( 12 - 2 ) shows that 0 ( 0 , 0 ) » 0 . 

By analogy with (12-4), one generally assumes that the best value for 
three or more measurements is also the arithmetic mean. Thus, 


mi + m2 + W3 
0(mi,m2,m3) =--- 


(12-9) 


We shall now use this assumption to determine the underlying probability 
distribution for the errors of measurement. 

Let the true value of the quantity being measured be denoted by t;. 
The errors, then, are 

Xt «= m» — V. ( 12 - 10 ) 
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Since the experimental determinations are made under substantially 
identical conditions, these random variables x,- are all assumed to have the 
same probability density f(x). And since the experiments are supposed 
to be independent, the joint density for two or three variables is given 
by the product: ^ 

/(^l,X2) «/(Xl)/(X2) (12-11) 

/(Xi,X2,X8) “/(Xi)/(X2)/(X3). (12-lla) 

Our task is to determine the function f(x). 

Now, V is the true value of the quantity being measured. It is not a 
random variable, and it is not at the disposal of the experimenter. Never¬ 
theless, one can contemplate the effect of a change in v, and in particular, 
one can consider that value of v which would maximize the probability of 
the observed result. We now postulate that the value of v which maximizes 
this probability is the arithmetic mean of the measurements. In other 
words, the best estimate^ (12-4) and (12-9), is assumed to be also a maximum- 
likelihood estimate. It will be found that this assumption ^ enables us to 
determine the form of the function / without any knowledge of the experi¬ 
mental process. 

If the probability (12-11) is maximum when 

V = (12-12) 


then the logarithm of the probability is also maximum. Thus 

log/(7ni - v) + log/(m 2 - ») 4- log/(m 3 - v) (12-13) 

is maximum, as a function of v, when (12-12) holds. Setting the derivative 
with respect to v equal to zero in (12-13), we obtain 

f'jmi - y) ^ fjmz - v) ^ /'(wa - f) ^ ^ 
fim - v) f{m 2 - v) finis - v) 


If is defined by 



‘ If we think of the errors as being discrete variables with / the frequency function, 
(12-11) is simply the law of compound probability for independent events. That Is, 
the probability of making an error a*i in the first experiment and X 2 in the second is the 
product of the individual probabilities. The corresponding result for continuous varia¬ 
bles and densities (stated in Sec. 6) is also a consequence of the theorem of compound 
probability. The notion of independence is discussed further in Sec. 13. 

• We shall suppose also that / is positive and twice differentiable, though these require¬ 
ments could be somewhat relaxed. 
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the foregoing result, in the notation ( 12 - 10 ), is 

F(xi) 4" F{x2) + Fixs) 0. (12-14) 

Equation (12-10) sliowa that (12-12) is equivalent to 

xi + X2 + - 0. (12-15) 

Thus, (12-14) bolds whenever (12-15) holds. The corresponding statement 
for two variables, obtained from ( 12 - 11 ), is that 

F{xi) + F(x 2 ) » 0 (12-lG) 

whenever xi + X 2 == 0 , and for one variable, we have 

F{xi) “ 0 when « 0. (12-17) 

From (12-16) we get ‘-F(xz) — F(—X 3 ) by choosing Xi ~ X 3 , X 2 ~ — 0 : 3 , 


and hence (12-14) gives 

F{xi) 4 “ F{x2) == -F(zb) = Fi-xs), 

Since —xs xi + X 2 by (12-15), the function F satisfies 
F(xi) 4 F(x2) = F(xi 4 X 2 ). 

Differentiating partially with respect to Xi and X 2 leads to 

F'{xi) == F'(xi 4* ^^ 2 ) and F'{x 2 ) - F'{xi 4 3 * 2 ). 

Hence F'{xi) = F*{x^. Holding X 2 constant, we see that F'{xi) is constant; 

F{x,) - c 


and hence F(xi) 


cxi, since (12-17) gives F(0) == 0. 
/'(x) 


m 


F{x) =5 cx 


The relation 


yields f{x) = 

where the constant K may be found from 

1 ~ / f(x) dx — K f dx, 

J —>00 • — s )0 


Since the integral diverges if c > 0 , we set c ^ —2h^ to obtain 


1 

K 




by (10-1). Hence X 


hfy/r, and 


/(») 




-*v 


'\/2jr 

\/2h 


(12-18) 
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This result, known as the Gamsian law of error^ states that the variable 
\/2 hx is normally distributed. Specifically, the probability of 

h < V2hx <t2 (12-19) 

ft^ny/2 h) h 

/ ^ 

Jill 


IS 


-A**'* 


dx 


(12-20) 


ilHVsih) -y/^ 

by (12-18), and the change of variable t = \/2 hx shows that (12-20) is 


f’ dt = Hh) - Hh)- (12-21) 

V2ir *'<1 

The most important consideration justifying the use of this analysis in 
practice is that systematic errors must be eliminated. 

The constant h measures the accuracy of the observer and is known as 
the precision constant. That particular error vhich has probability to 
be exceeded in magnitude is called the probable error; it is found to be 
0.4769//1 by use of (12-19), (12-21), and Appendix D. Another interpreta¬ 
tion of the constant h is afforded by considering the mean-absolute error 


, /•* 12 I 1 0.5642 

£(lx|) = / |x|/(x) dx = -^j dx = —— = - (12-22) 

Vir*'0 hy/Tr h 

and still a third interpretation is given by the mean-square error 

2/i /•*> « 1 

x^fix) dx---— dx = —(12-23) 

-00 \/w Jo 2h^ 

The final question mentioned at the beginning of this section concerns 
the effect of increasing the number of measurements 7i. Since a:» = m» — i?, 
we have 

^ fn — V 


where the bar denotes the arithmetic mean: 


X as - Sx,, rn — - Sm,-. 
n n 

Thus, the error in the mean is the mean of the errors. It is likely to be 
smaller than the error in a single measurement because positive and nega¬ 
tive errors tend to cancel when we form For the Gaussian distribution 
(12-18) the situation is especially simple; namely, x has a Gaussian distribu¬ 
tion wiih precision constant h\/ny whenever the independent measurements 
Xi have Gaussian distributions with precision constant h. Thus, if the in¬ 
equality |x| < a has probability p, then the inequality \Jc\< a/y/n has 
the same probability p. This result shows how much more precision is 
attained by increasing the number of measurements. 
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The ptool is omitted becftuse it involves a tedious evaluation of multiple integrals.' 
However, the easential meanmg of the result is that the “scatter” or “spread” for £ is 
1 /\/n times as great as the corresponding spread for x. When interpreted in this fashion 
the property follows from the results established in Sec. 14. 


PROBLEMS 

1 . (a) Show that the sum of the squares of the errors Z(mt — v)^ is least if the true 
value V happens to be the arithmetic mean of the measurements m,-. (b) Deduce that 
the arithmetic moan m is a maximum-likelihood estimate for v when there are n inde¬ 
pendent measurements each satisfying (12-18). Hint: It is required to choose v so that 

.X.) -/(xi)/(xs) .../(x») - 

is maximum. Use the result (a). 

2. In a certain experiment which satisfies the conditions of the text, the probable 
error is 0.01. A measurement mi is about to be made. What is the probability that 
the interval (mi — 0.02, mi -f 0,02) will contain the true value v? Hint: First find A, 
then note that the stated result happens if, and only if, | xi | < 0.02. 

13. Variance, Covariance, and Correlation. Two random variables 
X and y are said to be independent if the event x — and the event y ^ 
are independent events for each choice of r, in the range of x and each y^ 
in the range of y. In other words, knowledge that y has a particular value 
must not influence the probabilities associated with x. The numbers 
shown on two successive tosses of a die are independent in this sense 
(and so were the measurements nu considered in the last section). On 
the other hand, the number of heads in the first three tosses and in the 
first four tosses of a coin are dependent variables. 

The product xy of two random variables is a random variable which 
equals x^yJ when x = x» and y — Although it is not usually true that 
the expectation of a product is the product of the expectations, tlxis is the 
case when the variables are independent. In symbols, 

E{xy) = E{x)E{y)^ x, y independent. (13-1) 

The proof is simple. If p* is the probability that x =« x„ and if gy is the 
probability that y — yjy then the assumed independence gives p^gy for the 
probability that simultaneously x *= x^ and y = y^. Hence 

E(xy) « X:EpiqjXiyj = (Sp.x*) (Sgyy^) = E(x)E{y), 

^ See J. V. Uspensky, “Introduction to Mathematical Probability,” chap. 13, McGraw- 
Hill Book Company, Inc., 1937, for a direct verification. An indirect method based on 
the theory of moments is given in P. G, Hoel, “Introduction to Mathematical Statistics,” 
sec. 6.4, John Wiley & Sons, Inc., New York, 1954. See also M. E. Munroe, ^*The 
Theory of Probability,” pp. 91-96, McGraw-Hill Book Company, Inc., New York, 1951, 
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When a discussion involves several variables x, j/, ..., it is convenient 
to denote expectations by the letter m, with a subscript to indicate the 
variable. Thus, we write 

Eix) ^ Mx, E{y) == fly 

and so on. For example, (13-1) in this notation takes the form 

Mxy = MxMy, X, y independent. (13-2) 

To measure the deviation of a variable from its expected value /x, one 
introduces a quantity a defined by ^ 

<r = VEix - or = E{x - ju)®. (13-3) 

The expression a is called the standard deviation ^ and its square is called 
the variance. As for /x, here, too, it is customary to use a subscript when 
several variables have to be distinguished. For example, 

al = E{X - Mx)^ cr* = E{y - yyf. 

To illustrate the calculation of a variance by means of tlie definition, let x denote the 
number of heads obtained when 3 coins are tossed. Since ft * E{x) we have the 
following table: 


X «* 

0 

1 

i 

2 j 

3 

X — n ^ 

-H 



H 

(I - u. 

% 

H 

H 

y4. 

Probability p, « 

H 

Vs 

Vs 

H 


The definition of expectation now gives 
cr* « E{X - m)* « 4* 

If E{x) = fix and Eiy) == fiy, the quantity 

^xy ^ E{x fix){y My) (13-4) 

is called the covariance of x and y. The covariance is a generalisation of 
the variance, in that the special case y ^ x gives 

nL « E{x -- M.)(x -- Mx) « E{x ^ yx? « crj. 

As an illustration, let us compute when x is the number of heads obtainod on the 
first 2 tosses and y the number obtained altogether in 3 tosses of an unbiased coin. 

* Ihe intent is not — /*)]*. 
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Here 1, ^ so tlui.t we hare tb^ loEowing table: 


Event. 

HHH 

HHT 

HTH 

HTT 

THH 

THT 

TTH 

TTT 

X - Hz . 

1 

1 

0 

0 

0 

0 

-1 

-1 

V - t*v . 



H 

-H 

H 

-H 

-H 

-H 

Product.. 

H 


0 

0 

0 

0 

H 



Since the associated probabilities are we take times the sum of the entries in the 
last row to get 

- H. (13-5) 

We shall now obtain an expression for (r^y which is often more useful 
than (13-4). Expanding the product in (13-4) gives 

aly ~ E{xy — ytix — + MxMy) 

« E(xy) - E(y)yLx - E(x)y,y + 

Upon recalling that E{x) = and E{y) = My we get 

<rly = E{xy) - E{x)E{y) = M*y (13-6) 

which is the required formula. 

To apply this formula to the preceding example, we construct the following table: 


Event . 

HHH 

HHT 

HTH 

HTT 

THH 

THT 

TTH 

TTT 

X . 

2 

2 

1 

1 

1 

1 ; 



y . 

3 

2 

2 

1 

2 

1 

1 

0 

xy . 

6 

4 

2 

1 

2 

1 

0 



Taking 3^ times the sum of the last entries gives E{xy) « 2, and hence by (13-6) 

“ 2 - (DiH) » 

The special case x = ^ in (13-6) gives an alternative form ^ of (13-3), 
namely, 

cr^ E{x^) - M^ - E{x^) - [E{x)]^. ( 13 - 7 ) 

As an illustration the reader may apply this formula to the preceding 

example to obtain 

- H - Of o-y « 3 - mf = (13-8) 

' Note that <r* gives the moment of inertia of the area under the distribution curve 

y » /(x) about the line x «« m which passes through the center of mass. From this 
viewpoint (13-7) is the familiar formula for moment of inertia after a change of rotational 
axes. 


















f 


666 


PBOBABlXnT 


{CHAB, 8 


If the vaiiaWas x and y are independent, (13-2) and (13-6) give ** 0. 
Hence when ^ 0, the variables must be related. A quantitative 
measure of the strength of the relationship is given by the correlaiion 
(mfficimt p: 

p (13-9) 

CTarCTy 

For example, in the foregoing illustration (13-5) and (13-8) yield 


H _ 
VH 




0.816. 


(13-10) 


Thus, if two variables x and y have a correlation coeflScient p = 0.8, then 
they are about as strongly related as are the numbers of heads on the first 
two tosses and on the first three tosses of an unbiased coin. 


The correlation coefficient has the value 1 if j/ « x, and, as we have already observed, 
p « 0 when z and y are unrelated. Moreover, p does not change if x and y are each 
multiplied by a constant factor. Thus, if the correlation coefficient indicates a certain 
strength of relationship for x and y, it will give the same strength of relationship for 2x 
and iy. Similarly, p is unaffected by addition of a constant; for instance, x — 2 and 
y — 3 have the same p as x and y. 

In spite of having these desirable properties, p is not always a reliable measure of 
dependence, and many statistical studies have led to erroneous conclusions through an 
incorrect interpretation of correlation. It is quite possible to have the variables so 
strongly related that y is a function of x and yet p « 0. Before a correlation coefficient 
can be used with confidence, one must know something about the underlying probability 
distribution. 

The variables x and y are said to have a bivariate normal distribution when 
f{x,y) « €(«^’+26xv+c»a+<b-f-cv4/)^ const. 


In this important case the theory of correlation has been fully developed, and it is 
found * that p actually does measure the strength of the relationship between x and y. 

Example: A variable x is said to be “normally distributed with mean m and variance 
<r®“ when its density function is 

1 

f{x) *« t 2\ *r / ^ ^ ^ const. 

V 2ir ff 


Show that the mean is indeed p and the variance <r^. 
By the definition of expectation, 






when we set t • (x — u)/*r. Hence E{x — p) « 0, which gives E{x) « p. 
change of variable leads to 


E(x 




-0 

The same 


^ See Hoel, op, cU., chap. 8. 
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IM) we aee upon integrating by parte and using (MKl). Stnoe ti B{x), the latter v^esult 
E{x — /t)® is the variance by definition. 

Choosing m •* 0, <r »• l/(A\/2)i we obtain (12-lS), and hence the precision constant h 
is given by 

03 - 11 ) 


[cf. (12~23)]. This fact gives a method for estimating h from the data, aa we shaU see 
in the following sections. 


PROBLEMS 

1, Compute <r® if x is uniformly distributed on the interval 0 < x < 1. 

2. Ixit X be the number on top and y the number on the bottom in a toss of a true 
die. Compute Eix), Eiy), E(xy)^ and the covariance. Doea your work indicate that 
the variables are dependent? Find the correlation coefficient. 

8. Three coins are tossed. Let x be the number of henda shown by the first coin, 
whereas y is the number of heads shown by all the coins, ('ompute the correlation 
coefficient. Your result should be smaller than the value (13-10) Why? 

14. Arithmetic Means. In many applications one do(‘s not consider a 
single variable, hut rather one obtains the mean of a large number of 
variables. For instance, if x is a mi^asure oi the length of a rod, one would 
make several measurements xi, and use the arithmetic mean, 

. 0*1 + .r2 H- h Xn ^ ^ ^ 

X-- (14-1) 

n 

in accordance with the procedure of Sec 12. Here the .r,s are not the 
different values of a single variable but are n random variables describing 
the result of n independent measurements. 

Just as one uses (Tx to indicate the standard deviation of the variable x, 
it is customary to let Cf denote the standard deviation of x. The following 
theorem enables us to compute from <rx in many cases; 

Theorem. // the variables Xj are independent^ if they have the same ex-- 
pectation J5(x») 3 = p and the same variance (r% then 

- - 

For proof, observe that 

E{xi d-h Xn) = ^(^1) d— • d- E{xn) == np. 

The variance of Xi d-h is therefore 

Eixi d- ]-Xn - np)^, 

which may be written 

E[{xi — m) d- (^2 — m) H- \’ (xn p)f. 





8 


ExpandiBg the bracket we obtain 

£r [2 ^ - m) j* (14-3) 

Since the variables are independent, the covariance of x, and is zero 
for i 9 ^ j; that is, 

E(xi - n){xj - m ) « 0 . 

Also the definition of a, gives 

al « E{Xi — m)^* 

Hence, taking the expectation in (14-3) yields 

E{xi H- nn)^ ■= ncl. 

Dividing by we have 

[Xi-\ -h Xn 


E 


~ Xi -h Xn ^ ^ ^ 

n in 


which gives (14-2) upon taking the square root. 

The intuitive meaning of this result is approximately as follows: Suppose 
a single measurement varies over an interval of length / about the true 
value, so that I measures the scatter or spread. Then the mean of n in¬ 
dependent measurements will have a spread of the order of 1/ y/n about 
the true value. 

To illustrate the use of (14-2) let = 1 if there is success at tlie ^th 
trial in a set of independent trials with probability p, and let x,- == 0 other¬ 
wise. For each variable x,- we have xf = Xi and hence 

F(x?) = J5?(x0 -p-l + g-O-p. 

By (13-7) the corresponding variance is 


p* = p(l - p) = pg 


and (14-2) now gives 


art 


n 


For the variables Xi considered in the foregoing paragraph the mean t is simply the 
relative frequency tn/n, where m is the number of successes. We have, then, 




(14-4) 


which shows again that the relative frequency m/n is likely to be close to p when n is 
large. The corresponding result for a generwl variable x is based on (14-2); it leads to 
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assertions ocmcming \E(x) t\ wbidi are Mmiiar to the theorem estaWihed hi See. 
10 but of greater scope. 

Multiplying (14-4) through by n we get 

[j^(m — np)*J^ * Vnpff. 

This ^ves an interpretation for the quantity y/npq that arose in connection with the 
normal law (Sec, 10); namely, \/npq is the eUmdard deviation of the number of sucoeseee m. 

16. Esthnatioii of the Variance. If xi, ^ 2 , ..Xn are n independent 
observations of a variable x, the sample varicmce is defined by 

« 1 X{xi ~ . (15-1) 

n 

Unlike the theoretical variance the sample variance is computed from 
the observations, hence is actually available. It will be seen, now, that 
^ can be used to estimate 
W.e have 

E(ns^) - :^E(xi - 

- XE[(Xi ~ (i5 ^ ^)]2 

= 2[£(x< - - 2jB(x,- - ^)(x - m) + E(Si - ^^)\ (15-2) 

Now, E{xi — m)* = by definition, and E{x — (if — a^/n by (14-2). 
For the middle term in (15-2) we get 

1 

'^(Xi - m)(^ - m) = - E(X^ ~ H- Xi-\ -h Xn ~ Mn) 

n 

= - E{xi - ^)(xi-^ + E(xi - m)=* = - a* 

n n n 

when we note that the terms not written explicitly are independent of x*. 
That is, for i j, Eq. (13-1) gives 

£[(x,- — = E{xi — u)E{xj) = O'ju = 0. 

Substituting into (15-2) yields the important formula 

E{n^) = (n - l)a^. (15-3) 

If (15-3) is divided by n, we get 

0-2 (16-4) 

n 

upon recalling (15-1). On the other hand the definition of nfives 


^((l. -m)’] “ -r*. 


(15-5) 



670 


pftOBABiury 


[chap. 8 

It k not surpriBing th&t (IM) gives a smaller value than (15*5), inasmuch as the chaice 
f is the value of ft that minimizes (15-5) (cf. Prob. 1, Sec. 12), The fact that (16-4) 
should be smaller than (15-5) is especially clear when there is only one measurement, zi. 
In this <case (16-4) gives zero because xi « JR. 

The foregoing remarks indicate that is not a suitable estimate of 
it has a tendency to be too small But if we divide (15-3) by n — 1 for 
n > 2, we get 

which gives the following theorem: 

Theorem. Lei Xi^X 2 j .^Xn hen htdependeni observations of a variable jr, 
mth n > 2. If is the sample variance^ then the quantity 

(15-6) 

n — 1 

is an unbiased estimate of cr^. That is, E(&^) « 

To illustrate the use of the theorem, let 

mi =» 12, m2 “* 8, nti =• 13 

be three measurements of an unknown quantity whose true value is v. The errors in 
the measurement are x, *« m, -• r, but since 

Xt — 5 « m, — » — 4- r « m» ~ m (15-7) 

we can compute mihotU knouring v. By (15-1) and (15-7), 

» ]S(x, - x)* a* 2(mi — m)®. 

In this example SI -« 11, so that 

n«2 « {If -h (-3)^ + (2)2 * 14. 

Hence an estimate for is 



According to (13-11) the precision constant h is estimated ns h ^ \l{-yj2 d) »* l/\/i4 
«• 0.27. In statistics it is shown how one can determine the reliability of an estimate 
such as this, though we do not pursue the subject here.^ 

PROBLEMS 

1. A certain experiment gave the measurements 

m, *» 17, 21, 20, 18, 14. 

Obtain an unbiased estimate for the variance of a single measurement, and from this, 
estimate the precision constant. 

* See Hoel, op. al, chap. 10. 
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2. If tl:i« precision constant in Prob. 1 can be assumed exactly equal to your estimate 
of it, (a) what is the probability that the next measurement will be within 0.5 of the 
true value? (5) How many measurements must you make if you want a probability 
0.96 that the mean of those measurements will be within 0.1 of the true value? Hint: 
Use the fact that the precision constant of the mean is A\/n if that of a single measure¬ 
ment is h. 

8. In a certain measuring routine the cost of equipment and materials is negligible 
but the time required is proportional to the number of measurements. Give a rational 
method of adjusting the salaries of two observers whose w^orking speeds are «i and if 
the precision constants of their measurements are hi and h. Hint: Consider the number 
of measurements each must make to attain equal reliability in the respective arithmetic 
means. 

4* Discuss Prob. 3 if the cost of equipment is proportional to the length of time it is 
used and the cost of material is proportional to the number of measurements. 
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The principal concern of numerical analysis is with the construction of 
effective methods for the calculation of unknowns entering in the formula¬ 
tion of a given problem. Since every formulation of a practical problem 
involves assumptions and approximations, it is senseless to seek unknowns 
to a high(jr precision than is warranted by the initial data. A simple and 
perhaps crude technique giving the desired values within specified limits 
of tolerance is alwaj'^s to be preferred to an involved method capable of 
yielding an arbitrary degree of accuracy. 

In recent years the growth of numerical analysis was accelerated by the 
demands of science and technology for numerical solutions of many pressing 
problems. High-speed computing machines produced for coping with 
such problems are certain to open new vistas in science and leave a pro¬ 
found imprint in all fu^lds of human activity. 

It is the object of this ciiapter to present the rudiments of numerical 
analysis essential to all (ioncerned with the processing of numerical data. 
Inasmuch as the understanding of principles must precede the acquisition 
of computing skills, the emphasis in the following sections is placed on 
basic ideas and general methods rather than on spe(‘ial techniques useful 
in solving this or that problem. Among topics included here are the 
determination of real roots of algebraic and transcendental equations, 
the basic method for solving systems of linear equations, the elements of 
interpolation theory, and its bearing on curve fitting and numerical so¬ 
lution of differential equations. 

SOLUTION OF EQUATIONS 

1. Graphical Methods. Geometric considerations usually are a useful 
guide in the construction of analytic methods of solution of practical 
problems. This is particularly true in tlie problem of determination of 
numerical values of the roots of algebraic and transcendental equations.^ 

^ A polynomial equation ar” -j- H-}- » 0 is called an algebraic equation. 

An equation F(x) 0 which m not reducible to an algebraic equation is called 
dentaL Thus, tan x — x « 0 is a transcendental equation, and so is e® -f 2 cos x 0. 
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If F{x) is a real continuous function, the equation 

Fix) - 0 (1-1) 

may have real roots. The approximate values of such roots can be de¬ 
termined by graphing the function y = F{x) and reading from the grapli 
the values of x for which y ~ 0. This familiar procedure for graphical 
determination of real roots can frequently be simplified by rewriting (1-1) 
in the form 

m - gix), ( 1 - 2 ) 

The abscissas of points of intersection of the curves y » fix) and y = gix) 
will obviously be the roots of (1-2). 

Thus, an approximate value of the real root of 

Fix) s ~ 146.25X - 682.5 « 0 

can be found by graphing the function 

y ^ - 146.25Z ~ 682.5. 

It is simpler, however, to plot the cubic 

y « ap* 

and the straight line (Fig. 1) 

y = 146.25a: + 682.5 

and read off from the graph the abscissa of their point of intersection Pq, 

An obvious disadvantage of graphical 
methods is that they require plotting curves 
on a large scale when a high degree of accu¬ 
racy is desired. To avoid this, one obtains 
more precise values by applying one of the 
several methods of successive approxima¬ 
tions discussed in Secs. 2 and 3, All these 
methods require that the desired root be 
first isolated. That is, they call for the 
determination of an interval which contains 
just the root in question and no others. 
If Fix) is a continuous function, and if for 
a certain pair of real values x ^ xi^x ^ x^^ 
? the signs of Fixi) and ^(^ 2 ) are opposite, 
then it is obvious that Fix) « 0 has at least 
one real root in the interval ixiyX^). If there 
are several roots in ixi^t^y one usually nar¬ 
rows down this interval by a succession of judicious trials until an interval 
is obtained whfch contains just the desired root. For efficient applica- 
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tion of the successive-approximations methods it is desirable that this 
interval be as small as possible. 

We note in passing that no general methods are available for the exact 
determination of the roots of transcendental equations. Also, there are 
no algebraic formulas for the solution of general algebraic equations of 
degree higher than 4. The so-called Cardan and Ferrari solutions of the 
cubic and quartic equations require the calculation of cube roots of quan¬ 
tities which themselves are square roots. Generally it is simpler to obtain 
the desired approximations by methods described in the following sections 
than to make use of Cardan's formulas.* 

PROBLEMS 

1 . Find graphically, correct to one decimal, the real roots of: 

(a) 2^ - a:® « 0; (b) - x - I ^ 0; (r) x* - x - 0.6 « 0; (d) e* + a; - 0; 

(e) tan j—x=0, ir<J< 3ir/2. 

Isolate the roots (that is, for each root find an interval which contains just that root 
and no others), 

2. A sphere 2 ft in diameter is made of wood whose specific gravity is Find to 
one-decimal accuracy the depth h to which the sphere sinks m water. Hint: The volume 
of a spherical segment is vh\r — h/Z). The volume of the submerged segment is equal 
to the volume of displaced water, which must weigh as much as the sphere. If water 
weighs 62 5 lb per ft®, 

(r-?) 62.562.6, 

and since r •* 1, we have h} — Zh^ *+- ?^ ** 0. 

2. Simple Iterative Methods. WTien real roots of Eq. (1-1) have been 
isolated, there are many methods for computing them to any degree of 
accuracy. These all depend on the application of some iterative formula 
which furnishes values of the succeeding approximations from the preced¬ 
ing ones. The nature of restrictions imposed on the function F{x) in the 
equation 

Fix) - 0 (2-1) 

in the two basic iterative methods discussed here is obvious from the 
description of the methods. The simplest of these is the method of linear 
interpolation, also known as the method of false position. 

Let the root xo of (2-1) be isolated between xi and Then, in the 

^ A numerical determination of the roots of algebraic equations is frequently accom¬ 
plished by some method of synthetic division (such as Homer’s method) or by the root¬ 
squaring method (Graeffe’s method). These special methods are discussed in many 
books. See, for example, F. B. Hildebrand, ‘‘Introduction to Numerical Analysis/* 
McGraw-Hill Book Company, Inc., New York, 1956. The methods of Secs. 2 and 3 
of this chapter apply to all types of equations and are generally adequate for the deter¬ 
mination of real roots. 
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interv-al the graph of y » F{x) may have the appearance shown 

in Fig, 2. If the points Pi and in Fig. 2 are joined by a straight line, 

it will cut the x axis at some point 
^ 3 , which usually is closer to the 
root Xo than either Xi or X 2 * But 
from similar triangles, 



- xi 

-Fix,) 


X2 - Xs 


F{X2) 

and on solving for xs w'e get 
XlFiX2) - X2Fixi) 

X3 


(2-2) 


(2-3) 


Fix2) ~ F(.ri) 

To obtain a (‘loser approximation 
to .To, v\e can determine the x inter¬ 
cept of the straight line joining the 
point P 3 in Fig 2 with the point P 2 
and thus obtain the next approximation T 4 . By repeating this process ve 
obtain a sequence of values 


^ 3 > XI, f . ♦ *} Xfi) 

which generally converges to To. The process described here is precisely 
that used in interpolating tabulated valu(‘s of logarithms and other func¬ 
tions. In effect, it replaces a small portion of the cuiw(‘ bj^ a straight lim^. 
Another useful iterative method is baaed on rewriting (2-1) in the form 


Now, if the real roots of 


fix) - g{x). 
fix) == c 


(2-4) 


can be determined for e\Try real c, we can proceed as follows. Let Ti be 
an approximate value of the root To of (2-1). This, of course, is also an 
approximate root of (2-4), since (2-1) and (2-4) are equivalent equations 
On setting x ^ x, in the right-hand member of (2-4) we get the equation 

fix) = gixi), (2-5) 

which by hypothesis we can solve. If the solution of (2-5) is T 2 , we obtain, 
on setting t = T 2 in the right-hand member of (2-4), 

fix) « gix 2 ). (2-6) 

The solution T 3 of ( 2 - 6 ) we call the third approximation, and in general, 
the nth approximation Xn is determined by solving 

fix) « gixn^,). 


(2-7) 
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From the geometric interpretations of this procedure, which we give 
next, it will be seen that the sequence rri, ^ 2 , , Xn, ... converges to the 

root Xo of ( 2 - 1 ) if, in the interval of length 2 |xi — xoj centered at xo, we 
have 

l/'(x)|> lg'(x)l (2-8) 

and the derivatives arc bounded. 

Suppose, first, that the slopes of tiie curves 

y = f(x)y y « g{x) (2-9) 

in the interval Cro,xi) (Fig. 3) have the same sign and satisfy (2-8). When 
X = xi is taken as the first approximation to Xo, Eq. (2-5) yields the second 



approximation J 2 ; which (‘orresponds to the abscissa of the point of inter¬ 
section P 2 of the straight line y = g{xi) with y = /(x). Equation (2-6) 
gives J 3 , which is the abscissa of the point of intersection F 3 of the straight 
line y = fif(x 2 ) witli y == /(x), and so on. The sequence Xi, X 2 , X 3 , ... 
obviously converges to xq. 

The situation when the slopes of the curves (2-9) satisfy (2-8) but are 
opposite in sign is illustrated in Fig. 4. The value X 2 determined by 
solving ( 2 - 5 ) is the abscissa of the point of intersection P 2 of y « /(x) 
with y ^ g(xi). It lies on the opposite side of the root from Xi. The 
third approximation X 3 is the abscissa of the intersection of y =» gix 2 ) 
with y = /(x), and it lies on the same side as xi but nearer to Xq. In Fig. 3 
the approach to the intersection Pq is along a staircase path, while in 
Fig. 4 it is along a spiral. In either case, the rapidity of convergence ‘ 
depends on the nature of the functions/(x) and g{x). 

^ Some criteria for the speed of convergence are given in Hildebratid, op, cU, 
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Example 1. Determine the approximate values of the real roots of 

e* — 4x « 0. (2-10) 

The real roots of this equation are the abscissas of the points of intersection of the 

curves y ^ and jy « 4x shown in Fig. 5. 
It appears that the smaller of the roots, xo 
lies in the vicinity of x « 0.3. The larger 
root, ^ 0 , is close to x « 2 1. Since for x •* xo 
the slope of i/ * 4x is greater than that of 
y » e*, we write (2-10) in the form 

X » 

so that in the notation of Eq. (2-4) 

/(x) « X and g{x) « 

The sequence of approximations Xn according 
to (2-7) is thus determined from 

n»l,2, .... (2-11) 

If we take xi ® 0.3, we get * from (2-11) 

X2 - JieO.8 „ 14(1.34986) - 0.3374 

X, « «. ^(1.40130) - 0.3503 

X4 « « ^(1.41949) - 0.3549 

xj « « J4(1.42603) - 0.3565 

xe « - }i(l.42832) - 0.3571 

X7 - - Ji(1.42917) « 0.3578. 

‘ In performing these calculations it is convenient to use tables such as 'Tables of 
Esqiofientta] Functions/^ National Bureau of Standards, Washington, D.C., 1951. 
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If only three-dedmal-place accuracy is required, the computations can be terminated at 
this stage. 

To obtain the second root we note that at « (o« the slope of y 4x is less than that 
of j/ *• e*. If we write (2-10) in the form 

c* «* 4x 


or X ^ log 4at, 

so that/(a;) » x and g{x) *■ log 4x, then the condition (2-8) is satisfied at x ■« €o. 

The desired sequence jxn} is now given by 

Xn+i * log 4xn, n • 1, 2, ..., 

and we can take xi *» 2,1. 

Using tables of natural logarithms * we find 

2*2 ** log 4xi *• log 8.4 « 2.12823 

xs •* log 4x2 ** log 8.5129 « 2.14158 

tm log 4x8 log 8.5663 *» 2.14783 

Xj ™ log 4x4 log 8.5913 2.15075 

X6 - log 4x6 « log 8.6030 - 2.15211 

X 7 » log 4x6 - log 8.6084 - 2.15273 

X8 « log 4x7 - log 8.6109 *= 2.15303 

x» log 4x8 « log 8.6121 « 2.15316. 

The value of the root $o, correct to three decimals, is 2.153. We do not give a dis¬ 
cussion of the errors in the approximations obtained by such calculations because a 
rigorous analysis of errors in the iterative procedures is fairly involved.* 

Example 2. Find an approximate value of the real root of 


near x «* 3 t/2. 

From the graphs of 


X — tan X -• 0 


y X and y ■■ tan x 


( 2 - 12 ) 


in Fig. 6, it appears that £q. (2-12) has just one real root in each of the intervals 

(2n — l)ir/2 < i < (2n -f- 1 )t/ 2, where n « 0, ±1, ±2, - 

It is convenient to rewrite (2-12) in the form 

X tan~* X, 

so that in the notation (2-4) /(x) «• x and g(x) tan~' x. This choice assures that the 
condition (2-8) is satisfied at the root xq. 

The sequence of appro3dmstions this time is given by 

Xn+i tan~^ Xn, » «• 1, 2, .. 

*For example, 'Tables of Natural Logarithms,’' National Bureau of Standards, 
Washington, D.C., 1941. 

* A brief discussion is contained in Hildebrand, op. cif., chap. 10. 
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On taking •» 8ir/2 «* 4.7124 radians, we 6nd 

xz - tan~' 4.7124 « 4.5033 
xz tan~* 4.5033 « 4.4938 
Xi - tan~i 4.4938 «* 4.4935, 

which suggest that the root xo, correct to three decimals, is 4.493. 



These examples indicate that if it is possible to write Eq. (2-1) in the form 

X « gix), 

and if | g^(x) | < Af < 1 in the interval of length 21 xi — xo | centered at xo, then the 
recursion formula giving the desired approximating sequence is 

Xn+i « 9(xn), n « 1, 2, .... (2-13) 

PROBLEMS 

1. Use both methods of this section to obtain, correct to two decimals, the values of 
the real roots in Probs. 1 and 2 of Sec, 1. 

2. Find in the manner of the examples of this section the real roots of — x — 0.2 » 0 
correct to three decimals. 

3. Newton^s Method. The successive terms in the approximating se¬ 
quence in the method of false position (see Fig. 2) are determined by the 
intersection of tlie secant line with the x axis. Newton proposed con¬ 
structing an approximating sequence determined by the intersection with 
the z axis of the tangent line to the curve y - F(x). 

Thus let the root x = xq of 


F(x) « 0 


(8-1) 
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lie in tJie vicinity of x « xi (see Fig. 7). The equation of the tangent 
line toy ^ F(x) at Pi(xi,yi) is 


y - F{xi) « F'(xi)(x - xi). (3-2) 


If the curve y « F{x) has the appearance shown in Fig. 7, the tangent 
line (3-2) cuts the x axis at X 2 , which is a better approximation to the root 
than xi. To determine we set 


y Q and find 


X2 = Xi ~ 


F\x^) 


if F'(.Ti) 5 *^ 0. Having determined 
X 2 y we find in the same way that the 
tangent to y — F(x) at P 2 [^ 2 tF(^ 2 )\ 
intersects the axis at 


^3 = X2 


P(T2) 

F\X2) 


and in general, 





F(Xn) 

F\xS 


1 , 2 , 


(3-3) 


Tlie geometric considerations indicate that when y = F{x) is a mono¬ 
tone incrc'asing or decreasing function in the interval (xi,.T 2 ) [so that 
V'[x) does not change sign] and when there is no point of inflection in 
this inte‘r\al [so tliat F"{x) does not change sign], the sequence (3-3) 
conv’erges to the root Xq- 



The situations corresponding to the cases when there is a point of in¬ 
flection or a horizontal tangent to y = F{x) in the vicinity of the root are 
illustrated in Figs. 8 and 9. It is clear from these figures that in these 
cases the sequence (3-3) need not converge to xo- Thus, before applying 
Newton’s method one should examine the behavior of F'(x) and F'\x) 
in the vicinity of the root. 
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Bmwpk: Find the angle subtended at the center of a circle by an arc whose length is 
double the length of its chord. 

Let the arc BCA (Fig. 10) be of length 2BA. If the angle subtended by this arc at 
the center of the circle is 2® radians, then the arc BCA •» 2xr while BA 2r sin x, 
r being the radius of the circle. 

Our problem requires that 

2xr 4r sin x, 

or X — 2 sin X « 0. (3-4) 

On graphing the functions j/ =■ x and y « 2 sin x (Fig. U), we see that they intersect at 
X ■» 0 and at x » 1.88 radians, approximately. Wc reject the trivial solution x »» 0. 




Since y * x — 2 sin x is obviously monotone increasing and has no point of inflection 
near the root xo, we can apply formula (3-3) with n « 1.88. Wc find 

xi — 2 sin xi 

X2 xi - - —-- 

1 — 2 cos xi 


The third approximation is 


1.88 -- 2 sin 1.88 
1 - 2cor1.88~ 


X| 


X 2 — 2 sin X 2 
1—2 cos X 2 


1.896. 


* 1.896 - 


1.896 - 2 sin 1 896 
1 - 2 cos LSoST" 


1.8955, 


which is nearly the same as xa. The angle subtended hy the arc BCA^ as given by this 
approximation, is 3.7910 radians. 


PROBLEMS 

1. Calculate by Newton’s method the roots in Examples 1 and 2 of Sec. 2. 

9* Solve by Newton’s method Prob. 2, Sec. 1. 

Find to three decimals by Newton’s method the angle subtended at the center of 
a circle by a chord which cute off a segment whose area is one-fourth that of the circle. 

4. Find by Newton’s method to three decimal places the real roots of the following 
equations: (a) x - coex - 0; (6) x -f c* - 0; (c) x^ - x - 1 « 0; (d) x® - 25 - 0; 
W x* — X - 0,2 •• 0. 
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4. Systems of Linear Eqttations. The Gauss Reduction. No doubt ^he 
reader is familiar with Cramer^s rule for solving systems of n linear equations 
in n unknowns by determinants.* Although Cramer’s rule is important 
in numerous theoretical considerations, it is of questionable practical value 
when the given system contains more than two unknowns. Usually it is 
easier to obtain solutions by some process of elimination of unknowns. 
The simplest practical method for solving systems of linear equations, 
based on the idea of elimination, is the Gauss reduction method. Its 
several variants form the basis for most techniques used in the solutions 
of large systems of equations.* 

The idea of the method is simple. Lei it be required to solve a system 
of n linear equations 

+ • * * + Clin^n = Ci 

021X1 + 022^2 d-h a2nX« == C 2 (4-1) 


^n\^l "h ^n2^2 “4“ * * * *4“ ^nn^n 


in 71 unknowns Xi. We divide the first equation in (4-1) by On, solve for 
xi, and use the result to eliminate xi in the other equations. The resulting 
system of n — 1 equations in X 2 , ..., Xn is treated in the same way. 
That is, we divide the first of these equations by the coefficient of and 
use the result to eliminate X 2 from the remaining equations. After con¬ 
tinuing the process n times ® we obtain an equivalent system 


3^1 4“ ^ 12 X 2 4“ ni3^3 alnXn = Cl 

X 2 4* ^23^3 4" * * * 4" U2nXn = C 2 


Xn—1 4“ Un- 


nXn — Cn —1 
Xn ~ Cn 


(4-2) 


provided the given system has a unique solution. The substitution Xn ~ Cn 
in the preceding equation in the set (4-2) yields the value of x„__i, and 
by working backward we obtain in succession the values of Xn-. 2 , ^n~ 3 , ..., 

Xj, 

In practice the Gauss reduction can be performed in the manner indi¬ 
cated in the following example. 


• A summary of the properties of determinant and Cramer's rule are given in Ap¬ 
pendix A. 

• Among such variants are the Grout and the Gauss-Jordan reductions. These are 
described in Hildebrand, op, cit,, and in many other books. 

• If the coefficient of Xr in the rth equation vanishes, it is necessary to renumber the 
variables or equations. 
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EseampU: Bahfe the system 

2.843x1 - 1.326x8 -f 9.341X8 - 5.643 
8.673x1 + 1.295x3 - 3.215x8 « 3.124 (4^) 

0.173x1 - 7.724x3 -f 2.832x8 » 1.694 
by the method of Gauss' reduction. 

On dividing each equation in (4-3) by the coefficients of xi in that equation, we get 
XI - 0.46641X3 -f 3.4615x8 « 1.9849 

XI + 0.14931x2 - 0,37069x8 « 0.36020 (4-4) 

xi - 44.647x2 + 16.370x8 - 9.7919. 

The subtraction of the second equation in (4-4) from the first and the third gives 
-0.61572x2 4- 3.8322x8 * 1.6247 
-44.796x2 -f 16.741x8 * 9.4317 
and, on dividing these by the coefficients of X 2 , we find 
X 2 - 6.2239x3 « -2.6387 

(4-6) 

X 2 - 0.37372x8 « -0.21055. 

Subtracting the second equation from the first in (4-6) yields 

-5.8502x8 * -2.4282, 

(4-6) 

so that xa » 0.41506. 

The reduced system consists of the first equations in (4-4) and (4-5) and Eq. (4-6). It is 
xi - 0.46641x2 + 3,4616x8 « 1.9849 

X 2 - 6,2239x8 » -2.6387 (4-7) 

X8 « 0.41506. 

The substitution of the value of xa from the last into the second equation of (4-7) gives 
X2 « -2.6387 -{- 6.2239(0 41606) * -0.055408 
and the first reduced equation finally yields 

xi « 1.9849 + 0.46641 (-0.055408) - 3.4615(0,41506) « 0.62232. 

There are numerous modifications of the procedure just indicated, some of which are 
adapted for computations on desk calculators while others are more suitable for high¬ 
speed electronic computers. 

PROBLEM 

Use Cramer’s rule and also apply the Gauss reduction to solve the following systems: 

(a) 2x + j/ + 32 «* 2, 

3x — 2y — 2 » 1, 

X — y + 2 «* —1; 

^ (6) 2Xi + X2 + 3x8 + X4 «« —2, 

6 xi + 3x2 — xj — X 4 1 , 

Xj, — 2x2 ■+• 4x8 Hh 3x4 ■* 4, 

3xi *^X2 +X8 « 2; 
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(c) l.»29afi + lAlBxi - tMlsz ^ 0.532, 

L39&P8 - 0.681a?i - 1.211, 
l.OOloJi 4- 2.093X8 - 0.556, 

5. An Iterative Method for Systems of Linear Equations* Except for 
the round-off errors the Gauss reduction method explained in the pre¬ 
ceding section is exact. When the determinant of the system (4-1) is 
different from aiero, it yields the desired solution after a finite number of 
steps. However, successive steps leading to an equivalent triangular 
system (4-2) may prove laborious and ill-adapted to machine calculations. 
For this reason, a variety of iterative methods, which in theory require 
an infinite number of steps to obtain an exact solution, have been devised. 

One of these methods, due' to L. Seidel, is based on the use of the 
iterative formula (2-13). The convergence of any iterative method ob¬ 
viously depends on the character of the system under consideration. 

In many cases the system (4-1) can be rewritten so that in the tth 
equation the coefficient an of the unknown Xiis numerically large compared 
with other coefficients. That is to say, the coefficients along the diagonal 
of the system (4-1) dominate the other coefficients. In this event by solving 
the ith equation for x* we can rewrite such a system (4-1) in the form 

1 

Xi ^ - (Ci — ai2X2 ~ ^13X3- ainXn)f 

ail 

1 

X 2 (^2 <^ 21 X 1 023 X 3 ... — a2nXn)) (fi~l) 

022 


Xfi = {Cf^ a>nlXi 0^23^2 ' ’ * On.n—l^n—l). 

Onn 

If we set Xi =» X 2 = • • • == a:n = 0 in the right-hand members of (5-1), we 
obtain 

. i=l,2,...,n, ( 5 - 2 ) 

a»- 

which is called the first approximation to the solution of (5-1). 

Tlie substitution of this first approximation in the right-hand members 
of (5-1) yields the second approximation and so on. The cycle is 
then repeated with the expectation that the values xj^^ after the A;th iteration 
are not substantially altered by further iterations.* 

^ Generally called the Gauss-Seidel method. 

•There are several criteria for convergence of this process which generally are not 
easy to verify. It is known that when the coefficients in (4-1) are symmetric (so that 
Oi, ajt) and the matrix (an) is positive definite, the Seidel process always converges. 
See Hildebrand, op* for a brief discussion of several criteria. 
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In practice the iteration process described above is usually modified by 
taking as the first approximation the value of Xi obtained from the 
first equation in (5-'i) by setting X 2 « xs «=•••» Xn « 0. Using this 
value in the second equation in place of xi and setting X 3 X 4 =» • • • » Xn 
*» 0, one obtains the approximation To obtain one inserts for 
Xi and X 2 the values and x^^^ in the third equation and sets X 4 « xs ~ 
... Xn » 0. Finally, to get the value of x^^^ one uses previously found 
values x^\ ..., x ^-1 in the last equation of the system (5-1). This process 
is repeated to obtain approximations of higher orders. 

This particular choice of approximations usually improves the rapidity 
of convergence of the process. We illustrate it by an example. 

Example: The system (4-3) can be rewritten in the form 

8.673X1 -f- 1.295x2 - 3.215x8 - 3.124 

0.173x1 - 7.724x2 + 2.832x8 * 1.694 (5-3) 

2.843x1 - 1.326X2 -h 9.841x8 « 5.643 

in which the diagonal coefficients dominate. 

We next write (5-3) in the form (5-1) and get 

- 1.296XJ + 3.215*1) 

o.o7o 

U - -^21 “ 2.8321,) (54) 

I, - (6.843 - 2.843*1 + 1.326*,). 

9.841 

To obtain x^^^ we set X 2 x* « 0 in the first equation in (5-4) and find 

*5” - = 0.36020. 

8.673 


Inserting this value for xi and setting x* « 0 in the second equation in (5-4), we get 

xi*^ - -0.21125. 

Finally, ■» 0.44089 is obtained by using the values x}^^ and in place of xi 
and X 2 in the third of Eqs. (5-4). 

A repetition of the process yields second, third, and higher approximations. These 
are recorded in the table: 


k 

1 


3 

4 

1 

6 

7 

*i*) 

0.36020 

0.56517 

0.51780 

0.52312 

0.52220 

0.52235 

0.62233 


-0.21125 

-0.04523 

-0.05852 

-0.05501 

-0.05550 

-0.05543 

-0.05544 

*i« 

0.44089 

0.40694 

0.41594 

i 0.41488 

1 

1 

0.41508 

0.4150S 

0.41605 
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A oompariaoii with the values found iu Sec, 4 by the Gauss reduotlou method dbiows 
that in problem six iterations were necessaiy to get four^eoimal accuracy. 


INTERPOLATION, EMPIRICAL FORMULAS. LEAST SQUARES 

6. Differences. One of the problems connected with the analysis of 
experimental data concerns the representation of such data by analytic 
formulas. Thus, we may wish to represent, either exactly or approxi¬ 
mately, a set of observed values {xi^yi) by some relationship of the form 
y ^ fix). In such analysis the concept of differences is important. 

We consider a set of pairs of values {xi^y^^ where i » 0, 1, ..., n, which 
can be represented by points in the xy plane. The differences between 
successive pairs of ordinates and yi we call the first forward differences 
of the 2/8 and we denote them by Ay». Thus, 

= 2/t+i - Vif i « 0, 1, 2, ,.n. (6-1) 

The second forward differences are defined by 

A^yi « Ayi^i - Ay»- 

and, in general, the kth forward differerms are 

A^yt « ( 6 - 2 ) 

These differences are usually represented in a tabular form: 


Tabli 1 
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in which the quantities in each cdiumn represent the differences between 
the quantities in the preceding column. These are usually placed midway 
between the quantities being subtracted, so that the forward differences 
with like subscripts lie along the diagonals indicated in the table by arrows. 

We note that if the rth differences A^yt are constant, then all differences 
of order higher than r are zero.^ 

Now, it follows from (6-1) and (6-2) that 

Vi ^ Vo + Ayo 

yi + Ayi « (yo + Ayo) + (A^yo + Ayo) ^ yo + 2Ayo + A^yo 

^8 Vs + ^2/2 ~ ( 2^0 + 2Ayo + ^^Vo) + (A^yi + AyO 

(^0 4“ 2Ayo + A^yo) 4- (A^yo 4- A^yo 4* A‘^yo 4* ^yo) 

«= 2/0 4” 3 Ayo 4“ 3A^yo 4" A^yo* 

These results can be written symbolically as 

yi = (1 4“ A)yo, y 2 ~ (i 4“ A)^yo, ya *== (1 4- A)^yo 

in which (1 + A)^ is an operator on yo with the exponent on the A indicat¬ 
ing the order of the difference. The difference operator A is analogous to 
the differential operator D introduced in Chap. 1. 

We easily establish by induction that 

yfc-(l + A)V fc = l,2, (6-3) 

or, in the expanded fonn, 

k{k - 1) , k{k - l)(fc - 2) , 

Vk ^ Vo+ k Ayo -\ - - - A^vo H- - -AV d-• (6-4) 

Formula (6-4) enables us to represent every value yk in terms of yo and 
the forward differences Ayo, A^yo, .... 

We can derive a similar formula by starting with the values of the ys 
at the end of Table 1 and forming the backward differences defined as 
follows: The first backward differences Vy^ are 

Vy, « y, - y»^i. (6-5) 

The second backward differences V^y* are defined by 

VV = Vi/, - Vj/,_1, (6-6) 

and in general, the kth backward differences V*y, are 

V*J/. = V*^V. - V*-Vf_i. (6-7) 

^ A differeaces table in a specific numerical example appears in the Example of the 
next section. 
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A table of backward differences is indicated in Table 2, where the dif¬ 
ferences V^yi with a fixed subscript i lie along the diagonals slanting up, 
as shown by arrows. 

TABlJi 2 



Now, from (6*5) to (6-7) we deduce that 

^^J/n ~ ^Vn ^2/n—1 ~ Vn 2]^n--1 “H ^n—2 

== VVi - ^ Vn — ^Vn-l + 32/n~.2 - Vn-^ 

and in general ^ 

v*jf„ = - V''~Vn-l = 2 ) 2 /n-r, 

raatO \r/ 

/c(fc - l)(/c ~ 2) ... (fc - r + 1) 


where 


0 


(6^) 

(&- 9 ) 


is the binomial coefficient of in the expansion of (1 + x)^. 

By using (6-8) successively in the definitions of backward differences 
we find 

yn^l « 2 /n - ^Vn ^ (1 - 

yn—2 ~ 2/n ^ (1 — V)^yn) 


and, in general, - (1 - V)V„, (6-10) 

where V is the backward-difference operator. The formula (6-10) when 
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expanded reads 

Vn-k - y« - kVyn + 


Kk - 1 ) 
2! 




k(k - l)(ifc - 2) 
3! 


VVn+---. 

( 6 - 11 ) 


It shows that any value of y in Table 2 can be expressed in terms of pn 
and backward differences V*j/n* 

We shall use formulas (6-4) and (6-11) to derive certain interpolation 
formulas and to deduce some formulas for numerical integration. 


PROBLEMS 


1. Compute the forward and backward differences for the following set of data; 


X 

1 

2 

3 

4 

5 

6 

7 

8 

y 

2.106 

2.808 

3.614 

4.604 

5.857 

7.451 

9.467 

11.985 


2. Write expressions for the y*, A: »» 1, 2, ..in Prob. 1 by using (6-4) and (6-11). 


7. Polynomial Representation of Data. Unless a statement to the con¬ 
trary is made, we shall suppose henceforth that the values in^a given 
set of data (x,-,!/*), where i = 0, 1, 2, ..n, are equally spaced. If the 
spacing interval is hj then 

Xi =» Xo + X2 = Xo + 2/?-, . . . , Xn = Xq + Tlh, 

We pose the problem of representing the data by some formula y = f(x), 
which for x — xq + kh yields yk = /(xo + kh). We shall frequently write 
fk for pk. 

We observed in the preceding section that whenever the rth differences 
of the ys are constant, then all differences of order higher than r vanish. 
In this event formula (6-4) yields 


Vo + 




where the binomial coefficients 



are defined by 


(M) 


0 k{k ~ l)(k - 2) ... (fc - r + 1) 
* ~ rl 


(7-2) 


Since the x,* are spaced h units apart, 

Xk Xo + kh, ib =« 1, 2, ..., n, 

Xfc — Xo 

SO that k =» —-- (7-3) 

h 

Now th« expression (7-2) is a polynomial of degree r in k. Therefore, on 
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eubstituting in (7-1) for k from (7-3) we obtain a polynomial of degree r 
in X*. When like powers of Xk are collected, (7-1) talxes the form 

yjk «= Oo + aiXk + 02x1 H-h (7-4) 

Accordingly, the polynomial in x, 

y{x) ^ 00 + a\x + -1-1- (7-5) 

assumes the values yu when we set x « Xfc. Thus, when the rth differences 
of the yh are- constant and the Xk are equally spaced, the polynomial (7-5) 
represents these data exactly. 

It is easy to prove a converse to the effect that the rth differences of 
the polynomial (7-6) are constant. It would suffice to show that the 
first difference Ly{x) = y{x + /i) — y{x) formed with the aid of (7-5) 
is a polynomial of degree r — 1, for if differencing a polynomial once re¬ 
duces its degree by 1, r successive differencings would yield a polynomial 
of degree 0, that is, a constant.* 

When rth differences in a given set of data are not constant but differ 
from one another by negligible amounts, the polynomial (7-5) represents 
the data approximately. 

Example: The set of data and the forward differences tabulated below suggest that 
these data can be represented by a cubic polynomial ^ “ oo -f atx oax* + aax* if 
two-decimal accuracy is sufficient. 



* We leave it to the reader to show that Ay{x) is, indeed, a polynomial of degree r — 1. 
The result is analogous to the theorem that the derivative of a polynomial of degree r 
is a iK)lynomial of degree r — 1. The expression Ay y[x -f h) — y(x) save for the 
factor X/hiB the difference quotient used in defining the derivative. 
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Tht® Oi in this polynpttual can be determined with the aid of formula (7-4) 

by using (7-1) trith r 3 and by taking 

I/O «» 2.105, Ayo « 0.703, aVo "* 0.103, A®i/o « 0.081. 

Since such calculations present no interest, we do not give them here. It is more sensible 
to determine the o» by the method of least squares of Sec. 11. 

PROBLEMS 

1. Given the table: 


X 

19 1 

20 

21 

22 

23 

24 

25 

y 

81.00 1 

90.25 

100.00 

110.25 

121.00 

132.25 

144.00 


Compute second forward diflferonces, and represent the data by y »» Oo 4* 0i3: + OiX^. 
Determine <zo, Ui, 02 so that the polynomial passes through (a) the fiist three points, 
(b) the last three points. 

8. Discuss the calculation of the t/k in Prob. I from (5*4) and (5-11), 

8. Newton’s Inteipoktion Fonnuks. When the data {xi,y^), where 
i ^ Of If 2f ,,n, are presented in tabular form, an infinite numl>er of 
analytic relations y = f{x) can be devised siudi that iji — /(xO either 
exactly or approximately. Once a suitable form of /(.r) is determined, the 
formuk y «= /(x) can be used to calculate the ordinates y for xs not ap¬ 
pearing in the table. That is, the formula can be used for interpolation 
or extrapolation. 

The simplest of such formulas is a linear relationship based on the 
assumption that the valuas of y in the interval (xt',Xt 4 .i) can be represented 

by 

y = yi + ^^ —( 8 - 1 ) 

Formula (8-1) is precisely that used in estimating the values of such tabu¬ 
lated functions as logarithms by the process of “interpolation by pro¬ 
portional parts.” 

More accurate interpolation formulas are based on the assumption that 
the desired value of y can be computed from a polynomial 

y ^ ao + aix + a2X^ + * • • + ( 8 - 2 ) 

in which m + 1 coefficients arc so chosen that m + I pairs of tabulated 
values (Xi.yx) satisfy (8-2) exactly.^ 

In the preceding section we saw that when the data are represented by a 

^ These m 4 1 pairs may include the entire set of given values (xi,y»), or they may 
be a subset eo che«eu that \x - xd is as small as possible. 
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polynomial of degree w, then all forward differences of order higher than 
m vanish. Accordingly, formula (6-4) yields 


Vk 


m - 1) , 

2/o + fc A^o H-- A^2/o + * 


• + 


fc(fc — 1) ... (fc — m + 1) 

m\ 


(S^) 


and, since the x, are equally spaced, a:* => Xo 4- hh, so that 


- Xo 


On inserting this value of k in (8-3) we get 


Xk-Xo , (Xk 

y* = yo 4-;— Ayo 4- 


a^o)(a^ifc - Xp-h) 

2\h^ 


A^yo 4- ■ 


4- 


{Xk - Xo){Xk - Xq - /l) . ■ ■ (xt - Xq - wife 4- 

m\hr 


A’”yo. 


(8-4) 


This relation is satisfied by m 4" 1 pairs of the tabulated values. If 
we assume that the value of y corresponding to an arbitrary x can be 
oljtuincd from (8-4) by replacing Xk by x, we get the formula 


v(-r) = 1/0 4 - 



Ayo 4- 


(x — xo)(x - Xo - h) 


A^yo 4-• • • 


-b 


(x - Xo)(.r - X() - h) ... (x - Xo - mh 4- h) 


A^yo 


(8-5) 


known as Newton'f^ fonvard-diffcrcnce mtcrpolaiion formula. This formula 
can, of course, he used fi)r either interpolation or extrapolation. 

By replacing {x — uo)//i by a dimensionless variable A" which represents 
the distance of x from X(, in units of //, we get from (8-5) 


Px 


Vo + ^ Ayo + 


AX Y - 1) 
2! 


A^Vo 4- 


, A'(X- 1) ... (X-m4-l) „ 

4--;-A'-yo, 

ml 


(8-6) 


where X « (.r — Xo)/h and — y(xo + hX) — y{x). 

A similar calculation based on the use of (G-11) yields Newton's backward^ 
difference interpolation formula 


X(X + 1) ^ 

l/n+jsr ==' Vn + X Vyn d- - - V^yn + * 


, X(X+l)...(X + m~l)_ 

+-;- ^Vn (8-7) 

ml 
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where 



so that a; «* + hX, 


and Vn^x « y{xn + hX) « y{x). 

When the data cannot be represented by a polynomial, the right-hand 
members of (8-6) and (8-7) are infinite series involving differences of all 
orders. 

Formulas (8-6) and (8-7) can be used to compute derivatives of tabu¬ 
lated functions. Thus, on differentiating successively (8-5) with respect 
to X and setting x =« xo in the result, we get 




y"(.XQ) = — ^A^2/o - A 2/0 + — A 2/0 - - A 2/o H-) 

y'"ixo) = ^ ^A®yo - ^ A^2 /o + ^ aVo-^ 


( 8 - 8 ) 


y^''(xo) = ^ (A^yo - 2A* yo ^-)• 

Formulas (8-8) sliould be used with caution because even when y == /(x) 
is well represented by the polynomial P(x), the derivatives of /(x) may 
differ significantly from those of P(x). 

Example: Using the data given in the Example in Sec. 7, determine an approximate 
value for the y corresponding to x * 2.2. 

First, let y be determined by using only the two neighboring observed values (hence, 
m « 1). Then, xq * 2, j/o ® 2.808, Ayo » 0.800, and X « (2.2 — 2)/I » 0.2. Hence, 

y « 2.808 -h 0.2(0.806) « 2.969, 

which has been reduced to three decimal places because the observed data are not given 
more accurately. This is simply a straight-line interpolation by proportional parte. 

If the three nearest values are chosen, rrj =» 2, jo “ 1, Vo *“ 2.106, Aya « 0.703, 
aVo « 0.103, and X « 2.2 - 1 «« 1.2. Then, 


y - 2.105 -I- 1.2(0.703) +^--^(0.103) = 2.961, 
correct to three decimal places. 

If the four nearest values are chosen, m «» 3, xo “* I, j/o ■» 2.105, Aye 0.703, 
A*yo 0.103, A*yo "® 0.081, and X ** 1.2. Therefore, 


y - 2.105 + 1.2(0.703) + - -j-- (0.103) + (0.081) . 2.968, 

2 D 


correct to three demmal places. 
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PROBLEMS 


1. Compute with the aid of formulas (BS) and (8-7) the approximate values of p 
corresponding to z » 6.6 from the data of the Example in Sec. 7. Use two and three 
neighboring values. 

2 . Extrapolate the value of y (or x » 8.2 from the data in the Example of Sec. 7 with 
the aid of (a) formula (8-6), (b) formula (8-7). Use m «• 2. 

S. Compute j/'(l) and from the data of the Example of Sec. 7 with the aid of 
( 8 - 8 ). 


9. Lagrange’s Interpolation Formula. The interpolation formulas de¬ 
veloped in the preceding section apply only when the given set of x* is 
an arithmetic progression. If this is not the case, some other type of 
formula must be applied. 

As in Sec. 8, select the m + 1 pairs of observed values for which \x — x^\ 
is as small as possible, and denote them by where f = 0, 1, 2, ., 

Let the mth-degree polynomials PtCx), where A* = 0, 1, 2, ..m, be de¬ 


fined by 


(x - Xq)(x - Xx) . . . (x - Xm) 
X - Xjc 


n (a: - x^)■ 


Then, the coefficients AkOt the equation 


t^k 


y =* 


Z ^kPkix) 


k^O 


can be determined so that this equation is satisfied by each of the m 4* 1 
pairs of observed values (xt,yt). For if x = x*, then 


since P*(Xt) 


Ak 


Vk 

Pkixk)* 


0 if i 9 ^ k. Therefore, 


A VkPkix) 

y -3- > - 

kti> PkiXk) 


(9-2) 


is the equation of the mth-degree polynomial w^hich passes through the 
m -f 1 points whose coordinates are (x»,?/J. If x is chosen as any value 
in the range of the Xi, (9-2) determines an approximate value for the 
corresponding y. 

Equation (9-2) is known as Ixigrange^s mterpolation formula. Ob¬ 
viously, it can be applied when the x* are in arithmetic progression but 
(8-5) is preferable in that it requires less tedious calculation. Since only 
one mth-degree polynomial can be passed through m + 1 distinct points, 
it follows that (8-5), or its equivalent (8-6), and (9-2) are merely different 
forms of the same equation and will furnish the same value for y. 
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V 

10 

16 

22.5 

33.75 

50.625 

75.987 

p 

0.300 

0.675 

1.619 

3.417 

7.689 

17.300 


apply lia.graQge'8 formula to find the value of p corresponding to » ■» 21. 

If the two neighboring pairs of observed values are chosen so that m »• 1, 


P 


« 0.675 


21 - 22.5 
15 - 22.5 


-f 1.519 


21 - 15 
22.5 - 15 


1.350, 


eorrect to three decimal places. 

If the three nearest values are chosen so that m « 2, 


^ ^ (21 - 15)(21 ~ 22.5) , ^ (21 -- 10)(21 - 22.5) 

V » (J.o---r u.o7o----— 

^ (10 - 15)(10 - 22.5) (15 - 10)(15 - 22.5) 


correct to three decinxal places. 


-f 


(21 - 10)(21 - 15) 
^*^^^(22J^0)‘(22.5 - 16) 


1.323, 


PROBLEMS 

1. Using the data of the Example in Sec. 9, find an approximate value for p vhon 
p * 30. Use rn w 1 and m » 2. 

2. Use m a* 1, 2, and 3 in formula (8-6) to find an approximate value of 0 when 
i w 2.3, g^ven 


t 

0 

1 

2 

3 1 

4 

5 

6 

7 

8 

e 

60.00 

51,66 

44.46 

38.28 

32.94 

28.32 

24.42 

21.06 

18.06 


8. Given the data 


X 

0.16 

0.4 

1.0 

2.5 

6.25 

15.625 

y 

2 

2.210 

2.421 

2.661 

2.929 

3.222 


find an approximate value of y corresponding to x « 2. Use formula (9-2) with w « 1 
and mm2. 

4. Given the data 


C 

19 

20 

21 

22 

23 

24 

25 

H 

81.00 

90.25 

100.00 

110.25 

121.00 

132.26 

144.00 


find an approximate value of ff when C *• 21.6. Use formulas (8-6) and (S-7) with 
m 2, and 3. 
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10, Eiilipilieal Fonutilas* A given set of discrete data can be represented 
anal 3 rtically in infinitely many ways. Such analytic representations are 
called empirical formulosj and the choice of the functional form for an 
empirical formula ordinarily depends on the use to be made of the formula. 
Thus, if a given set of data is to be represented by a function f{x) which 
enters in the differential equation 

L{u) ^fix), 

the form of fix) may well depend on the ease with which this equation can 
be solved. For some types of differential operators L it may be wise to 
take fix) as an algebraic polynomial, in others as an exponential, and so 
on. Because of the commonness of algebraic and trigonometric poly¬ 
nomials in applications, we confine our discussion of empirical formulas 
primarily to these two types. 

The first step usually taken by an experimenter in appraising a set of 
observed values (xi^yi) is to plot them on some coordinate paper and draw 
a curve through the plotted points. If the points (x„2/f), when plotted 
on a rectangular coordinate paper, lie approximately on a straight line, 
he assumes that the equation y ^ mx + h represents the relationship. 
To determine the constants m and 6, the slope and the y intercept may be 
read off the graph or they may be calculated by solving two linear equations 
for rn and i!> got by substituting the coordinates of two judiciously chosen 
points on y - mx + b. 

If the plot of points on a logarithmic coordinate paper indicates that they 
lie on a straight line, the desired relationship has the form 

y == ax’”, 

for on taking logarithms, we get 

log y = log a + m log x, 

and if coordinate axes X, Y are marked so that log y ^ Y and log x - X, 
we get a linear equation 

F = log a + 7nX. 

Again the constants a and m can be either read off the graph or computed 
by solving a pair of linear equations for m and log a. 

Similarly, the data can be represented by an exponential function 

y « alO’”* 

if the values (xi^yd when plotted on a semilogarithmic paper fall on a 
straight line, for on taking logarithms to the base 10, we get 

log y « log a + mx, 

which is linear in log y and x. 
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When non© of these simple functional relationships fits the data, one 
may determine, with the aid of Sec. 7, if the data can be fitted by a poly¬ 
nomial. It aiiould be stressed, however, that ordinarily the choice of an 
empirical formula is governed by whatever uses are to be made of it. Once 
a formula is chosen, the parameters entering in it (such as the coefficients 
in the polynomial representation) can be determined by imposing some 
criterion for the goodness of fit of the data by the chosen function. The 
method of least squares, presented in the next section, provides one of the 
most commonly used of such criteria. 


PROBLEMS 

1. Plot the following data on a rectangular, logarithmic, or semilogarithmic paper to 
determine the approximate functional relationships between y and x. 


X 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

V 

X 

6 

1' 

1 ^ 

1 

.6 

1 ^ 

6 

M 

6.4 

3 

7 

4 

7.5 

5 

8.2 

6 

8.6 

7 

9 

8 

9.5 

1 ^ 

V 

2.5 

3.5 1 

4.3 

5 

5.6 

6.2 

6.6 

7.1 

7.5 


X 

1 

2 

3 ! 

4 I 

5 

6 

7 

8 

y 

0.5 

0.8 

1.2 

1.9 

3 

4.8 

7.5 

11.9 


2. Verify that the data in Probs. 2, 3, and 4 in Sec. 9 may be approximated by the 
following types of functions: 6 « y « ox”*, ff ^ oo + aiC -f respectively. 

Determine the parameters graphically or analytically. 

11. The Method of Least Squares. We saw in Sec. 7 that the m + \ 
coefficients in the polynomial 

2/ = Oo + aix H-h (11-1) 

can always be determined so that a given set of m + 1 points 
where the a:s are unequal, lies on the curve (11~1). When the Xi are equally 
spaced, the desired polynomial is determined by the formula (8-5) and, 
in the more general case, by (9-2). 

When the number of points is large, the degree m of the polynomial 
(11-1) is high, and an attempt to represent the data exactly by (11-1) 
not only is laborious but may be foolish, for the experimental data in¬ 
variably contain observational errors and it may be more sensible to rep¬ 
resent the data approximately by some function y =« f{x) which contains 
a few unknown parameters. These parameters can then be determined 
so that the curve y « /(x) fits the data in ‘^the best possible way/' The 
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criteria as to what constitutes *'the best possible way"' are, of course, 
arbitrary. 

For example, we may attempt to 
fit the set of plotted points in Fig. 12 
by the straight line 

y = ai + aa:c 

and choose the parameters ai and az 
so that the sum of the squares of 
the vertical deviations of the plotted 
points from this line is as small as 
possible. 

More generally, if we choose to 
represent a set of data (:r„y»), where 

7 1, 2, ..., n, by some relationship y == /(x), containing r unknown 

parameters Ui, 02 , ..., ctr, and form the deviations (or the residuals^ as 

they arc also called) ... m 

^ v,=fix,)-y„ (11-2) 



the sum of the squares of the deviations 

s s x; v? = 2 {/{xi) - Vif 

»«i t—i 


(11-3) 


is clearly a function of ai, 02 , ..Or. We can then determine the os so 
that S is a minimum. 

Now, if S(ai,a 2 .- • • ?«r) is a minimum, then at the point in question 


dS 

dS 

— = 0 , 

— 

aai 

daz 


dS 

dUr 


(11-4) 


The set of r equations (11-4), called normal equationsy serves to determine 
the r unknown as in y - fix). This particular criterion of the ‘^best fit^' 
of data is known as the principle of least squaresj and the method of de¬ 
termining the unknown parameters with its aid is called the method of least 
squares. It was introduced and fully developed by Gauss ^ when he waa 
a youth of seventeen! 

We indicate the construction of the normal equations first by supposing 
that y — fix) is a linear function 

y — Cl + azx. ( 11 - 6 ) 

‘ The criterion of least squares plays a fundamental role in the approximation of a 
suitably restricted function f{x) by a linear combination of orthogonal functions. As is 
shown in Chap. 2, Sec. 23, the partial sums of Fourier series give the best fit in the sense 
of least squares. It should be noted, however, that the polynomials giving the best 
fit to/(x) in the sense of least squares, in general, are not the partial sums of Maclaurin^s 
or Taylor’s series for/(x). 
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m 

The leeiduals (11-2) for (11-6) are 

Vi » («! + oax.) - yi, 

so Utat jS * 

= (ai -f- a2Xi — yi)* -f- (oi -f- oaXa — ya)* 

+ •••-)- (Oi + OaXn — Vn)*. 

On differentiating S with respect to «i and oa, we deduce two equations: 


— = 2(oi -f- OaXi - yi) + 2{ai + aaxa - y-S) 
doi 

+ ■ • -+ 2(01 + a2x„ — y„) =■ 0 , 

dS 

— = 2 xi(ai -)- OaXi - j/i) -f 2x2(0, + O2X2 - 1/2) 

802 

-I-h 2x„(0i + 02 l„ - Vn) = 0. 

If we divide out the factor 2 and collect the coeflBcients of a, and 02 , we get 

+ { S A ) «2 = 2 yi> 

'•-.1 / 1=1 

( 11 - 6 ) 

( X) a:,) 0, + ( xf\ 02 = Xij/,. 

\»=»1 / / imvl 

These equations can be easily solved for ai and a 2 . 

Emmple 1. We illustrate th€» use of Eqs. (11*^) by calculating the coefficients in 
a m -f oaa; to 6t the following data: 


X 

1 

2 

3 1 

4 

y 

1*7 

1.8 

2.3 

3.2 


In this case n <« 4, and since 

4 

D J, = 1 + 2 + 3 + 4 - 10, 

4 

'Zvi’" 17 + 1-8 + 2.3 + 3.2 - 9, 

t»l 

4 

i;x?-l+4+9 + 16-30, 

4-1 

4 

2] i<y< - 1.7 + 2(1.8) + 3(2.3) + 4(3.2) - 25, 
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the Sjrstem (11*6) reads 

4ai -f lOos *■ 9, 
lOai *f 3002 - 25. 


Solving for Oi and 02 , we get oi *» 1, 02 » »o that the desired straight line fitting the 
data in the sense of least squares is 1 / « 1 4- 


We suppose next that y » f(x) is a polynomial 
V - ai + a2X + 4-(- 

« Z) 

3^1 


The residuals Vi this time are 


Vi « X * - Vu 

J-1 


(11-7) 


( 11 - 8 ) 


Since 


s = 5D»?, 


Eqs. (11-4) can be written as 


dS A 

~ = 2Z‘-.—= 0 , 

oCLk M 


k = 1 , 2 , 


From (11-8), 


— - r‘-‘ 

n » ^ 

dajfc 


(11-9) 


so that, on dividing out the factor 2, we can write the normal equations 
(11-9) as 

n 

E = 0. (11-10) 

The substitution from (11-8) in (11-10) yields 

2Z (s - y.) = 0. 

and on collecting the coefficients of the ay, we get a set of r finear equations 
22(2 o,- = X) xf~ V. fc = 1, 2, ..., r, (11-11) 

JW Vt—I / 1—1 

for Ui, 02 , ♦ *., a^. 

We illustrate the use of these equations by two examples* 
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Mmmpk 2. JM the data ib Example I be fitted by p «> ai oia: + ajar*. Then 

• ai + oaxi + Oiaf - Vi 


and 

The normal equations 


dt»t , dvi dVi ^ 

— ** 1 — «. Xi, — ** xf. 

dai dot das 

f;vi—-0, *-1,2,3, 

»-S dOk 


S (“1 + + ^ 3^4 ““ J /*) *1 * 0 , 


(<^1 + «2^' *f asxf - y,)x, - 0, 

♦—i 


2 (0i 4- OflXi + asxf - yt)4 - 0. 

i^i 

If the coefficients of the aj are collected and the normal equations put in the form 
(11-11), one obtains the three equations 




4 (S 

4 

” Zv., 

t-1 



4 

4 

* 23 



4 ( 22 ^ 4 ) 

- E !?»<• 

>-i 

4-1 

«1424344»10, 

53 a: J * 1 4 

4+9 + 16 

4 

* 1.7 4 3.6 4 6.9 412 

.8 ** 25, etc. 



Now, 


4-1 

The equations become 

4ai 4 lOoj 4 30oj - 9, 

lOai 4 3002 4 IOO 03 » 25, 

30oi 4 lOOo* 4 35408 « 80.8; 

ind the solutions are oi ** 2, 02 * —0.5, os «• 0.2. 

ExampU 3. Let us apply the method of least squares to fit the data 


X 

1 

2 

3 

4 

5 

6 

7 

8 

y 

2.105 

2.808 

3.614 

4.604 

5.857 

7.461 

9.467 

11.985 


|>y the polynomial^ *■ oi 4 4 Oix* 4 aup*. 
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In this case n •• 8 and Bqa. yield fovir normal equatkma obtained by wtting 

fc — 1, 2, 3, 4. They are 

8«i + (S *<) “» + “* (,^^) “* “ 

(s*‘) +(.4=^) “»+(s*^) *** “ 

(Ex?) oi + (E^) “1 + (E*}) 0* + (E**) «< - 
( E I?) ai + (E®f) <H + (E^) + ( E**) o< - E4w- 

From the form of the coefficients of the a*, it is seen that it is convenient to make a 
table of the powers of the x^, and to form the sums "Zx^i and Zx\y% before attempting to 
write down the equations in explicit form. 


B 

X? 

^8 

Xi 

Xi 


4 

1 

1 

1 

1 

1 

1 

2 

4 

8 

16 

32 

64 

3 

9 

27 

81 

243 

729 

4 

16 

64 

256 

1,024 

4,096 

5 

25 

125 

625 

3,125 

15,625 

6 

36 

216 

1,296 

7,776 

46,656 

7 

49 

343 

! 2,401 

16,807 

117,649 

8 

04 

512 

4,096 

32,768 

262,144 

2x{ 36 

204 

1,296 

8,772 

61,776 

446,964 


B 

Vi 

xm 

x,y. 

r-.. 

x\yi 

1 i 

2.105 

1 

2.105 

2.105 

2 

2.808 


11,232 

22.464 

3 

3.614 

10.842 

32.526 

97.578 

4 


18.416 

73.664 

294.656 

5 

5.857 

29.285 

146.425 

732.125 

6 

7.451 

44.706 

268.236 

1,609.416 

7 

9.467 

66,269 

463.883 

3,247.181 

8 

11.985 

95.880 

767.040 

6,136.320 

Zx\yt 

47,891 


1,765.111 

12,141.845 
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Wben the vahiee gipen ia the tablse are inserted, the normal equations become 
8oi + 36o* + 204oj + 1,29604 - 47.891, 

36oi + 204OS + l,296oj + 8,772o4 - 273.119, 

20401 + l,296aj + 8,772os + 61,77604 - 1,766.111, 
l,296oi + 8,77208 + 61,776os + 446,964o4 - 12,141.846. 

The solttticms are 


ai - 1.426, 02 - 0.693, as « -0.028, 04 - 0.013. 

Therefore, the equation, as determined by the method of least squares, is 
y « 1.426 -f 0.693X - 0.028x^ + O.OlSte*. 

The nonnal equations (11-11), corresponding to the polynomial repre¬ 
sentation of data, are linear in the coefiScients a^. They need not be linear 
in the unknown parameters if the function y — fix) is not a polynomial 
in X, In this event the solution of the system (11-4) may prove difficult, 
and one may be obliged to seek an approximate solution by replacing the 
exact residuals (11-2) by approximate residuals which are linear in the 
unknowns. This is accomplished by expanding y - f{x), treated as a 
function of ai, a 2 > • • •> in Taylor’s series in terms of — d* s Aa*, 
where the are approximate values of the ai. The values of <l, may })e 
obtained by graphical means or by solving any r of the equations y, ~ fix%). 
Hie expansion gives 

y » fix, ai, . . ., Ur) ^ fix, + AOi, dr+ AUr) 


where 


^ df 

' fix, di, dr) + ^ AUfc 
k^i oak 

1 d"/ 

+ r : 2 -/ - ^^3 ^ 


2\j,k^i d&jddk 


( 11 - 12 ) 


d/ d/ 

-s - 

ddk dak 




d^f 


ddj dak daj dak 


etc. 


ia/*«07 

ak-mak 


Assuming that the dt are chosen so that the Aa» are small, the terms of 
degree higher than the first can be neglected and (11-12) becomes 




E — 

Jfc«i ddk 


Aa*. 


The n observation equations are then replaced by the n approximate 
^ equations 


df 
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If (11-13) is used, the residuals Vi will be linear in the Aauj and henee the 
resulting conditions, which become 

BS 

——- = 0, fc -» 1, 2, ..r, (11-14) 

d(Aafc) 

also will be linear in the Aak, Equations (11-14) are called the normal 
equations in this case. 

We illustrate the use of Eqs. (11-14) in Example 4. 

Example 4. We seek to determine the constants k and a in the formula 6 ka* 
chosen to represent the following data: 



1 

2 

3 

4 

0 

51.66 

44.46 

38.28 

32.94 


The determination of k and a in this problem can be reduced to the solution of two 
linear equations, for if we write 0 *• ka^ in the form 

0 » klCf*^ 

then on taking logarithms to the base 10, we get 

log 0 « log I: -f bt. 

Setting log ^ and log ■* X, we get 

+ (11-15) 

which Is linear in K and h. These constants can be determined by the procedure de¬ 
scribed above, which leads to the solution of a pair of linear equations.' 

To illustrate the use of formulas (11-12) to (11-14), we follow a more laborious route 
W'hich gives an approximation to the original equation. 

When the values recorded in the table are plotted on semilogarithmic paper, it is 
found that A; « 60 and a « jQ-ooes ^ 0.86, approximately. This suggests using A:© “ 00 
and oo “* 0.9 as the first approximations. The first two terms of the expansion in 
Taylor's series in terms of AA. =* A: — 60 and Ao « o — 0.9 are 

0 « 60(0.9)' -f AA: + Aa 

\aA:/a—o» \aa/o«o.9 

« 60(0.9)' + (0.9)' Ak -f 60i(0.9)'-~i Aa. 

If the values (UM are substituted in this equation, four equations result, namely, 

Bi - 60(0.9)'‘ + (0.9)'* Ak + 60<.(0.9)'‘“‘ Aa, i - 1, 2, 3, 4. 

The problem of obtaining from these four equations the values of AA: and Ao, which 
furnish the desired values of is precisely the same as in the case in which the original 
equation is linear in its constants. The residual equations are 

Vi - (O-g)** Ak + eowo.o)"'* Aa + 60(0.9)“ - Bi, i - 1, 2, 3, 4. 

' However, the approximation obtained by this means does not give an approximatiois 
to the original equation in tlie sense of least squares. 
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Therefore 


NmCBElCAL AKAtraiS 


■s » Z - Z l(0-9)‘‘ A* + 60«0.9)*<-‘ AO + 60{0.9)‘< - »<J*, 

imml ill). 


[CHAJP. 9 


aad the normal equations 


become 


and 


dS 

d(m 


«0 and 


dS 

d(Aa) 


-0 


2 E 10-9^ Aifc -f 6(W<(0,9)<<-“^ AO + 60(0.9)*» - tf4l0.9*< - 0 

i-I 


2 D [0.9'* A^ 4- 60i^(0.9)'<~‘ Ao + 60(0.9)'^ - « 0. 


When these equations are written in the form 

p All 4 g Ao « r, 

with all common factors divided out, they are 


and 


Z (0.9)"‘ Ak + 60 Z t.(0.9)*‘‘-‘ AO - £ «<(0.9)‘‘ - 60 2 (0.9)"< 

9aMi ♦■■I ••>) 

4 4 4 4 

Z W0.9)*‘<-‘ Ak + 60 Z ^(0.9)*'<-* AO - Z 9<<<(0.9)'--i - 60 £ «.(0.9)*‘<-‘. 

««■) ♦■■I ♦■•I *»*1 


As in Example 3, the coefficients are computed most conveniently by the use of a table. 


Bi 

B 

2 

3 

4 

Totals 

(0.9)*' 

0.9 

0.81 

0.729 

0.6561 


(0.9)*“ 


0.6561 

0 531441 

0.43046721 

2.42800821 

t<(0.9)*“-i 


1.458 

1.77147 

1.9131876 

6.0426576 

«f(0.9)’“-* 

1 

3.24 

6.9049 

8.503056 

18.647956 

(e<){0.9)“ 

46.494 

36.0126 

27,90612 

21.611934 

132.024654 

(9<fc)(0.9)“-‘ 

51.66 

80.028 

93.0204 

96.05304 

320.76144 


Substituting the values of the sums from the table gives 


and 


2.42800821 Ait 4 362.559456 Aa » 132.024654 - 145.6804926 
d.0426fi76 Ah 4 1,118.87736 Aa » 320.76144 - 362.559456. 
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Reducing all the numbera to four decimal places gives the following equations to solve 
for Ak and Aa: 

2.4280 Ak + 362.5595 Aa » -13.6658, 

6.0427 Ak + 1,118.8774 Aa « -41.7980. 

The solutions are 

Ak -» -0.238 and Aa « -0.036. 

Hence, the required equation is 

e « 69.762(0.864)*. 

PROBLEMS 

1. Apply the method of least squares to find the constants in j/ « ai -f O/^x -f- ajijc® 
to fit the data 


X 

1 

2 

3 

4 

5 

6 

V 

3.13 

3 76 

6.94 

12.62 

20.86 

31.63 


2. Determine by the method of least squaras the constants a and n in p ** Of»" to 
fit the following data by writing the equation in the form 

log p « n log V -f log a. 


V 

10 

15 

22.5 

33.7 

1 50.6 

75.9 

V 

0.300 

0 675 

1.519 

3.417 

7.689 

17.300 


Hint: Set log p »» y, log v x, and determine the constants in the resulting linear 
equation. 

8. Compare the result of Example 4 with the calculation of the constants in (11-16). 

12. Harmonic Analysis. The problem of representing a suitable periodic 
function in a trigonometric series was considered in some detail in Chap. 
2. In this section we give a brief discussion of the problem of fitting a 
finite trigonometric sum to a set of observed values Let the set 

of observed values 

(^2«—l>2/2n—1)» (^2n>2/2n)> 

be such that the values of y start repeating with t/ 2 n (that is, y 2 n «= yoj 
2 / 2 n-fi = Vu etc.). It will be assumed that the x, are equally spaced, that 
xq = 0, and that X 2 n = 27r. [If Xq 9 ^ 0 and the period is c instead of 
2 t, the variable can be changed by setting 

2ir 

= (xi —* 3 ^ 0 ) • 

c 

The discusmon would then be carried through for «,■ and j/,- in place of the 
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Zi and Pi used below.] Under these assumptions 


[CSAF. 9 


Xi a» 



iw 

n 


The trigonometric polynomial 

n n—l 

2^ » ilo + kx + ^ Bk sin kx (12-1) 

Jkw.! k^l 

contains the 2n unknown constants 


*^0; ■^2f • • •> Bi, B2 j . . Bn^lf 

which can be determined so that (12-1) will pass through the 2n given 
points (xi^Vt) by solving the 2n simultaneous equations 

n n—l 

Vi ^ Aq + Ak cos kxi + t “ 0, 1, 2, ..2n 1. 

k^l km.1 


Since Xi «» tV/n, these equations become 

” ikr ikr 

Vi Ao+ Za AkCOB -h 2^ Bk sin — t 

*•1 w ^ 


i « 0, 1, 2, 2n - 1. (12-2) 


Hie solution of Eqs. (12-2) is much simplified by means of a scheme 
somewhat similar to that used in determining the Fourier coefficients, 
Multiplying both sides of each equation by the coefficient of Aq (that is, 
by unity) and adding the results give 


A A' /"A' 

53 Vi « 2nAo + 53 ( 23 cos — ) + 23 ( 23 sin — ) Bk. 

A:«»l ^ »ts»0 ^ ' t"»0 


n / 


It can be established that (cf. Example 1, Sec. 17, Chap. 2) 


and 

Therefore, 


2w-l 

ikw 


E 

cos — 

= 0, 

M 

n 


2n-l 

ikv 


r 

sin — 

= 0, 

f«»0 

n 



2n-] 

2nAo « 53 Vi^ 
♦-•0 


fc = 1, 2, ...,n 
A = 1, 2, ..., n — 1 


(12-3 


Multiplying both sides of each equation in (12-2) by the coefficient of A 
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in it, and adding the results, gives 

ijr A/V ikr ijr\ 

2 ^ j/f cos — « 2 ^{ 2 j cos — cos — lAk 

itmO ^ krml \ t.»0 ^ ^ / 

. ikr 0V\ 

+ I Zu sm — cos — 1 Bk 

r • ... . « A ib-i \ t~o n nf 

for j « 1, 2, ..n — L But 


ikv ijv 
2 ^ cos — cos — = 0, if k 9 ^ jj 

»—0 


71 


n 


n, 


if & = j, 


and 


^ , ikv ijv 

2^ sm — cos — =» 0 
t«.o n n 


for all values of k. Therefore, 

2n-l 

nAj 53 2/»co8 

,«.o n 


tjTT 


i = 1, 2, ..., n ~ 1. 


(12-4) 


To determine the coefficient of i4„ the procedure is precisely the same, 
but 

ikir 

> . cos — cos iw = 0, if k n, 
n 

== 2n, if =« n. 


Hence, 


2w —1 

2nAn = 53 

»=».o 


(12^5) 


Similarly, on multiplying both sides of each equation of (12-2) by the 
coefficient of Bk in it and adding, one finds that 

tjV 

= 2 2 /. sin —• j = 1, 2, ..., n - 1. (12-6) 

.-0 n 

Equations (12-3) to (12-6) give the constants in (12-1). A compact 
schematic arrangement is often used to simplify the labor of evaluating 
these constants. It will be illustrated in the so-called “6-ordinate’^ case, 
that is, when 2n == 6. The method is based on the equations that deter¬ 
mine the constants, together ^vith relations such as 

IT (n •— 1 )t (n + l)7r (2n — l)7r 

sin - « sin-* — sin-== — sin -- 

n n n n 


7C 

cos - 
n 


(n — 1 )t 

— cos-- 

n 


— cos • 


(n 4" l)ir 


COS 


(2n — l)Tr 


n 
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Six-ordinate Scheme, Here, 2n « 6; the given points are (xi,yi), where 
Xi * tV/3 {i « 0,1, 2,3, 4,5); and Eq. (12-1) becomes 

y ^ Aq + Ai 008 X + A2 COB 2x + A3 cos 3x -f sin x + sin 2®. 

Make the following table of definitions: 


I/O yi 1/2 

VO Vi 

v;o vt>i 

yz Vi 3/s 

V 7 

Wt 

Sum . . Vo vi 

Po Pi 

ro n 

Difference {M^o wh m >2 

Qi 



It can be checked easily that Eqs. (12-3) to (12-6), with n = 3, become 

6.4o - Po + Pi, 3^1 = ro + 3^42 = Po HPi» 

V3 V3 

6i43 « ro ~ «i, 3i?i = — 3i?2 = — gi- 

2 2 

Example: In particular, suppose that the given points are 


X 

1 

0 

IT 

3 

2ir 

3 

■jr 

47r 

3 

5ir 

T 

2tr 

V 

1,0 

1.4 

1.9 

17 

1.5 

1.2 

1.0 


Upon using these values of y in the table of definitions above, 


1.0 


1.1 

1.9 

1.7 


1.5 

1.2 

2.7 

Vi = 

2.9 

i!;2 3.1 

-0.7 

Wi * 

-0.1 

* 0.7 


2.7 


2.9 

-0.7 

-0.1 



3.1 


0,7 

po « 2 7 

Pi « 
gi » 

6.0 
-0 2 

ro ^ —0.7 

n « 0.6 

«1 a* —0.8 


Therefore, the equations determining the values of the constants are 


64o » 2.7 + 6.0 

8.7 

and 

^0 - 

1.45. 

34i ■» —0.7 •— 0 4 «“ 

-1 1 

and 

Ai - 

-0.37, 

342 « 2.7 - 3.0 « 

-0.3 

and 

At — 

-0.10, 

643 « -0.7 + 0.8 « 

0,1 

and 

A, - 

0.02, 

3B, » — (0,6) - 

ji 

0.3V3 

and 

B, - 

0,17, 

Vi 

3B, - —(-0.2) - 
2 

-o.iVs 

and 

Bj •• 

-0,06. 
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Hence, the curve of type (12-1) that fits the given data is 

y — 1.45 — 0.37 cos x — 0.10 cos 2x -f 0.02 cos 3x -f 0.17 sin x — 0.06 sin 2x, 

A convenient check upon the computations is furnished by the relations 

Va 

Ao 4- -dj 4- -<12 4* -dg «« ya and 4- Bg *■ (yi — y^). 

Substituting the values found above in the left-hand members gives 

1.45 - 0.37 - 0.10 4- 0.02 « 1.0 and 0.17 ~ 0.06 - 0.11, 

which check with the values of the right-hand members. 

Similar tables can be conslructed for S-ordinates, 12-ordinate8, etc. 


NUMERICAL INTEGRATION OF DIFFERENTIAL EQUATIONS 


13. Numerical Integration.^ The reader is familiar with the interpreta¬ 
tion of the definite integral / f(x) dx as the area under the curve y ~ /(x) 

Ja 

between the ordinates x — a and x «= 5. This interpretation underlies tlie 
construction of formulas for numerical integration contained in this sec¬ 
tion. 

It will be recalled that if the function /(x) is such that its indefinite 
integral can be obtained, then the fundamental theorem of integral calculus 
provides an easy means for evaluating the definite integral.^ However 
w'hen/(x) does not have an indefinite integral expressible in terms of known 
functions, or w4en the values of /(x) are given in tabular form, formulas 
for numerical integration are generally used to obtain an approximate 
value of the integral. 

Formulas for numerical integration, or mechanical quadrature^ are ob¬ 
tained by replacing the function /(x) specified at a given number of points 
in the interval {a,b) by a polynomial (8-5) or (9-2), depending on whether 
the values of x are equally or unequally spaced. 

If the values of ^ = f(x) are known at m 4- 1 points where i = 0, 
1, 2, ..w, which are spaced h units apart, an approximate value of the 

integral / '^'fix) dx can be computed by substituting in the integrand an 

approximate polynomial representation of y - f{x) given by (8-5) or, 
equivalently, (8-()). We thus get for equally spaced values x,- 

j^ydX =j^^yo + X Ayo + A^o 


+ ••• + 


XiX - 1) ...(X -m+1) 
ml 



dX, 


(13-1) 


' See Chap. 3, Sec. 13. The evaluation of difficult integrals by power series is discussed 
to Chap, 2, Sec. 10. 
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where X is the dimensionless variable defined by 


X — Xo 


and X =« m for 


= *0 + mh. 

K m * 1, formula (13-1) yields 


(13-2) 

(13-3) 


j^y dX =£(yo + X Avo) dX - j/o + “ = yo + ^ (yo + Vi). 


But from (13-2) dX = dx/h^ and on recalling (13-3), we see that this 
formula can be written as 

[ 'ydx == ^(yo + i/i). (13-4) 

Jxo 2 

Since yo is the ordinate oi y ^ f(x) at r == xq and yi is the ordinate at 
X » Xi, the right-hand member in (13-4) represents the area of the first 
trapeaoid sliown in Fig. 13. The choice of m = 1 in the calculations 



leading to (13-4) corresponds to replacing y == /(x) in the interval (xo,xi) 
by the straight line through (xo,yo) and (xi,yi). 

The successive application of (13-4) to intervals (xi,X 2 ), (X 2 ,X 3 ), 
(xn--i,Xn) yields 


dz 


= [ 'ydx + f *ydx+* 

Jxa Jxi 


■ + /*" ydx 

JXrL-\ 


~ ( 2/0 + 2/l) -f - (^1 + 2 / 2 ) H-h (l/n-l 4- 2^n) 

it it 2 

— “ (Vo + 2j/i -f- 2 i /2 -b • • • -f 2yn_i + y„). 


( 18 - 6 ) 
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Formula (13-5) is known as the trapemdal rule, for it gives the vi^e of 
the sum of the areas of the n trapezoids whose bases are the ordinates 
yo> t/iiVa, • • •. Vn- Figure 13 shows the six trapezoids in the case of n = 6. 
If m » 2, (13-1) becomes 

r* /•* r (X® - X) , 1 

lydX ^ 1^ Uo + XAyo + - - - - - A*yo I dX 

= 22/0 -I- 2 Aj/o + - ^ - 2 j A®2 /o 
1 

= 2yo + 2(yi - yo) + - iva - ^Vi + Vo) 

O 

1 4 1 

“ 3 2/0 + - 2/1 + - 2/8, 

f^i h 

or ydx ^ ■-(yo + 4pi + 2 / 2 ). (13-6) 

^xo 3 

Suppose that there are n + 1 pairs of given values, where n is even. If 
these n + 1 pairs are divided into the groups of three pairs with absciv-Has 
^ 2 »fb ^ 2 »+ 2 ) where t = 0, 1, . .(71 — 2)/2, then (13-6) can be applied 
to each group. Hence, 

rXn fx^ rx^ rx^ 

/ ydx^l ydx + \ y dx -h / y dx 

Jxsi Jxt, Jxi Jxn-i 


h h 

- - (yo + 42/1 + ya) + - (ya + 42/3 -1- 2 / 4 ) 

o o 


H-h “ (?/n-2 + 4 yn^l + Pn) 

o 

h 

= - [yo + yn + 4(^1 + ys H-1- yn-i) 

O 


+ 2(7/2 + J^4+-*-+y«^2)]. (13-7) 

Formula (13-7) is known as Simpson's rule with m = 2. Interpreted 
geometrically, it gives the value of the sum of the arenas under the second- 
degree parabolas that have been passed through the points ix 2 ity 2 i)y 
(® 3 .+i<yai+i), and (x 2 ,+ 2 ,y 2 .+ 2 ), where i »= 0, 1, 2, ..(n - 2)/2. 
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If flt «B 3, (13-1) states that 

/»_/«/ X® - X , X* - 3 X® + 2 X - \ , 

jT 1/ dX - ^yo + X Ayo H- - -A Vo H- - -A®yo j 

9 /9 9\ , /27 9 3\ , 

= 3yo + -Ayo + (^--)Ayo + (--- + -)Ayo 

9 9 

“ 3yo + - (Vi - Vo) + - (y 2 - 2yi + yo) 

2 4 

3 

+ ~ ( 2/8 ~ 3^/2 4* 3^/1 — yo) 

o 

3 

- ( 2/0 + 3yi + 3 i/2 + yz)f 

o 

/■*! Sh 

or ydx —(yo + Syi + 3^2 + ys)- (13-8) 

Jxo 8 


If n + 1 pairs of values are given, and if n is a multiple of 3, then (13-8) 
can be applied successively to groups of four pairs of values to give 

£ fi 3^ 

ydx ^ —lyO + yn + 3(yi + y 2 + 1^4 + 2/5 H-+ yn --2 + yn~l) 

8 

+ 2(y3 + ye H h yn-s)]- (13-9) 

Formula (13-9) is called Simpson's rule 'with m ~ Z. It is not en¬ 

countered so frequently as (13-5) or (13-7). Other formulas for numerical 
integration can be derived by setting m = 4, 5, ... in (13-1), but the three 

given here are sufficient for ordinary purposes. In most cases, better 

results are obtained by securing a large number of observed or computed 
values, so that h will be small, and using (13-5) or (13-7). 


Example 1. Using the data given in the Example of Sec. 7, find an approximate value for 
j^ydx. 

Using the trapezoidal rule (13-5) gives 

^ y dx » H(2.105 -f 5.616 4- 7.228 + 9.208 -f 11.714 4- 14.902 4- 9.467) *» 30.120. 


Using (13-7) gives 

j\dx - H[2.m -b 9.467 4- 4(2.808 4- 4.604 4- 7.451) 4- 2(3,614 4- 5.857)] ~ 29.989. 
Using (13-9) gives 

^ y di - H[2.m 4- 9.467 4- 3(2.808 4- 3.614 4- 5.857 4- 7.451) 4 - 2(4.604)] « 29.989. 
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If numerical integration is to be used in a problem in which the form of 
fix) is known, the set of values (xitVt) can usually be chosen so that the 
Xi form an arithmetic progression and one of the formulas deduced above 
can be applied. Even if it is expedient to choose values closer together 
for some parts of the range than for other parts, these formulas can be 
applied successively, with appropriate values of ft, to thase sets of values 
for which the form an arithmetic progression. However, if the set of 
given values was obtained by observation, it is frequently convenient to 
tise a formula that does not require that the x,- form an arithmetic progres¬ 
sion. 

Suppose that a set of pairs of observed values where t 0, 1, 

2, ..., m, is given. The points (xi.yi) all lie on the curve whose equation 
is given by (9-2). The area under this curve between x — xo and x « 

is an approximation to the value of f y dx. The area under the curve 

Jxo 

(9-2) is 

/*" 2 / dx = 52 ^ / “ Pk(x) dx, (13-10) 

in which the expressions for the Pkix) are given by (9-1). 

If 7n «= 1, (9-1) and (13-10) give 

f ‘ 2 / dx == —^(x — xj) dx H- ^(x — xo) dx 

Jxo Xo — Xi •'*0 Xi — Xo •'*0 

Xi ■“ Xo 

= — - —(2/0 + 2/i). (13-11) 

A 

Formula (13-11) is identical with (13-4), as would be expected, but the 
formula corresponding to (13-5) is 

\ ydx =* m(x, - xo)(yo + yi) + (x 2 - xi)(yi 4- 2 / 2 ) 

Jxa 

-i- h (x„ - X„_,)(yn-1 + Vn)]- (13-12) 


If m = 2, (13-10) becomes 
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go pi - 4 _ {Xi + Xa)(a| -- a^g) 

Po(xo) L 3 2 


+ XiZ2iX2 


-*o)] 


gi r 

\ixi) L 


a-i “ a^o (a^o + a'2)(a| - x*) 


Va \xl-xl (xo + xi)(3-i - xo) 


Pa(xs) L 

{X2 - Xo)^ r go 


4- XoX2(X2 


+ XoXl(X2 


-*»)] 

- Xo)j 


- (3xi - 2xo - X 2 ) + r (Xo - X 2 ) 
(Xo) Pl(Xl) 


+ (2X2 + Xo - 3xi) . (13-13) 

Pai^a) J 

Formula (13-13) reduces to (13-6) when xi — Xo = 0*2 ~ '^1 ~ The 
formula that corresponds to (13-7) is too long and complicated to be of 
practical importance, and hence it is omitted here. It is simpler to apply 
(13-13) successively to groups of three values and then add the results. 

Example 2. Using the data given in Prob. 3, Sec. 9, find an approximstti value of 


£ .26 

ydx. 

It 


Using (13-12) deternunes 


^6.26 

/ ydx ^ 3^(0.24(4.210) + 0.6(4.631) + 1.5(5.082) -f 3.75(5.590)] « 16.187. 

Jo At 

Applying (13-13) successively to the first three values and to the last three values gives 
1*-^ . (0.84)* r2(1.2 - 0.32 - 1) 2.210(~0.84) , 2.421(2 + 0.16 ~ 1.2)1 


/^•*» (0.84)* r2(1.2 - 0.32 - 1) 2.2 

6 I (*-0.24)(“-0.84) (0.: 


24)(~0 6) 


(0.84)(0.6) 


(6.25)* r2.421(7.5 - 2 - 6.25) , 2 001(~5.25) 


(-1.5)(~5.25) 


(1.5)(-3.75) 

2.929(12.5 4- 1 - 7.5)^ 
(5.25)(3.75) 


PROBLEMS 


Determine the values of ^ y dx by applying (13-5) and (13-7) to the following data: 


xl 234667 
y 2.167 1 3.519 4.198 4.539 4.708 4.792 4.835 
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% Apply formula (13-12) to compute / pdt? from the data of the Example in Sec. 9. 

Jia 

8, Work the preceding problem by applying (13-13). 

>25 

4. Apply formulas (13-6) and (13-7) to compute / H dC from the data of Prob. 4, 

9 - ^ ^_ 

6. Find approximate values of j V 4 4- x® da; by applying formulas (13-5) and (13-7) 
with Xm •» tn, m « 0, 1, 2, ..., 6. 


14. Euler’s Polygonal Curves. The methods available for the exact 
solution of differential equations, as we noted in Chap. 1, apply only to a 
few, principally linear, types of differential equations. Many equations 
arising in applications are not solvable by such methods, and one is obliged 
to devise techniques for the determination of approximate solutions. 

We begin with the consideration of the first-order equation 

y' = fM (14-1) 


and seek its solution y = y{x) taking on a prescribed value yo = yixo) at 

X - Xo^ 

At each point of the region where /(x,y) is continuous Eq. (14-1) deter¬ 
mines the slope of the integral curve passing through tlmt point. The 
equation of the tangent line at the point {xQ,yo) to the integral cun^e 
y = y{x) is 

y - Vo f{xo,yo)i^ - (14-2) 


If we advance along this line a short distance to a point (xi,yi), we can 
compute from (14-1) the value y'{xi) — f{xi,yi) which, in general, will 
not be cq\ial to the slope of y = y(x) at x = Xj, because the point (xi,yi) 
ordinarily will not lie on the integral curve y — y{x). But if (xi,^i) is 
close to (xo,2/o), the slope of the integral curve at x = xi will not differ 
much from /(xi, 2 /i). To put it differently, the linear function (14-2) 
approximates the solution of (14-1) in the neighborhood of the point 
(xo,t/o)* (See Fig. 1 in Chap. 1, Sec. 1.) 

We consider next the straight line through (xi^yi) with the slope f(xi^yi) 
and proceed along it a small distance to a point (x 2 , 2 / 2 ). At (x 2 ,yz) wq 
draw another straight line with the slope /(x 2 ,t/ 2 ) a,nd advance along it to 
a point (x 3 , 2 / 3 ). By continuing this construction we obtain a polygon 
consisting of short straight-line segments joining the points (xo,2/o)> 

(^ 2 , 2 / 2 ), --v (^n,yn)- The polygonal curve so obtained is called Euler^s 
polygon. This polygonal curve can be expected to approximate the in¬ 
tegral cur\^e reasonably well when the points (xt,^*) are not too far apart 
and the end point {x^din) is not too far away from (^o^^o)- 

The end points of the segments forming Euler's polygon clearly satisfy 
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[Cf, (14*.2)] 

y% - Vo ^ fi^oyyo)ixi - ^o) 

y 2 -yi^ f{^uy \){^2 - ^x) ( 14 - 3 ) 


yn - yn-l = fiXn^uVn^lWn - Xn^i) 
aad if each interval x* — Xt^i is of length we can write (14*^) as 

Vm+I ^ Vm +f{^m,ym)hy m « 0, 1, 2, ..., n - L (14-4) 

The recursion formula (14-4) enables us to compute successively the 
approximate values of the ordinates of the integral curve y - y{x) at 
Xk — Xo + where A: = 1, 2, ..., n. It may suffice for rough calculations 
if the spacing interval h is small and m not too large. 

A more accurate formula can be obtained by constructing, instead ^f 
the chain of rectilinear segments, a chain made up of parabolic segments. 
Thus, we can draw through (xi^yi) a parabola 

y *= oo + Ui(a: — xi) 4* a2(x — Xi)^ (14-5) 

which at X Xo has the slope /(xo,2/o) and at x « Xj the slope /(xi, 2 /i). 
A simple calculation of the constants in (14-5) yields 

y 2 ^ yi+ {y'M + Hly'M - y'Mllh. (i4-6) 

This formula serves to determine ,V 2 if yi ==* y(xi)) y'(xi), and the difference 
Vyi s y'(xi) — y'(xo) are known. Now, if we suppose that the solution 
y(x) can be represented by Taylor's formula 

y(i) = Vo + - xo) + H2 /"(^o)(x - xo)* - h R„, (14-7) 

we can calculate the needed quantities in the right-hand member of (14-6). 

The coefficients in (14-7) can be calculated from (14-1) whenever/(x,y) 
has a sufficient number of partial derivatives, for on setting (xo,yo) in 
(14-1), we get y'(xo) =“/(xo,yo)- Differentiating (14-1) with respect to 
X yields' 

y"(x) « fx{x,y) + /v(x,y)y'(x), (14-8) 

and substituting x «« xo, y = yo ict (14-8) gives 

y"(xo) «/*(xo,yo) +/v(xo,yo)y'(xo). 

By differentiating (14-8), we obtain y"'(x), and so on. The value of Rn 
in (14-7) in general cannot be computed, but by neglecting it we get an 
approximate value of y(x). 

Once the coefficients in (14-7) are determined, we use (14-7) to compute 
* We use the subecript notation for partial derivatives introduced in Sec. 2, Chap. 3* 
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y(Ti) yi. The value of y\xi) is then determined by (14-1), since y'(Ti) 
** The substitution in (14-6) then yields y^. 

Having computed ^ 2 , we can advance another step and compute 
from [cf. (14-6)] 

»3 = ^2 + {y'{x2) + Hbf'ixz) - y'ixi)]}h. 

This requires calculating y'(x 2 ) from (14-1). 

The general recursion formula, based on the parabolic approximation, is 

2/m+l = 2/m + [y'(Xm) + M W(^m)]h, (14-9) 

where Vy'(xJ « y\xn,) - 2/'(x,n-i)- 
More elaborate recursion formulas can be constructed by using poly¬ 
nomials of higher degree instead of (14-5). Such formulas lie at the basis 
of the Adams method of integration of differential equations discussed in 
the next section. 


PROBLEMS 

1 . Construct a polygonal approjcimation, in the interval ( — 1,1), to the solution of 
y* «■ yixy which is such that y(0) « 1. Take the spacing interval h » 0.2. Also obtain 
the exact solution, and plot it on the same sheet of paper. 

2. Determine the coefficients in (14-5), and thus deduce (14-6). 

S. Use the equation in Prob. I to illustrate the calculation of from formula (14-6). 
Also obtain y*. Take xo ** 0, j/o ** 1> * 0.2, xj « 0.4. 

16, The Adams Method. We extend the considerations of the preceding 
section by developing a step-by-step procedure for computing an approxi¬ 
mate solution of 

y' == fi^^y) (15-1) 

taking on a prescribed value t/o at x = xq. The ordinates approximat¬ 
ing the ordinates of the integral curve y = y{x) at x = x^, will be de¬ 
termined for equally spaced values of x, so that == xo + hm, where 
m = 0, 1, 2,_ Thus our approximate solution will appiear in a tabu¬ 

lated form for a discrete set of values of x. 

By the Fundamental Theorem of Integral Calculus, 

f^' y'ix) dx^f (^) dx = y(xm + h) - y(xn) 

Jxm \dz/ 

BO that Wm+i = ym + f y'{x) dx, (16-2) 

where w y{x„ + h) and y„ a yixn). 

Now, if the variable x in the integral of (15-2) is replaced by 

I - -f- hX (15-3) 
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v^uere X is a new dimensionleas variable, (16-2) beccones ^ 

Vm+i ~y„ + h y'{x,n + hX) dX. (16-4) 


But we saw in Sec. 8 that when a function y'{x) is approximated by a 
polynomial of degree n taking on the values y'^, y'm-i, ..2/m-n at a; = x^, 
x„_n, then [cf. (8-7)] 


, , X(X -1- 1) , . 

ir'(aw-bhX) - + - 


+ 


2! 

XiX -hi) ... (X -f n - 1) 


(15-6) 


If we insert (15-5) in (15-4) and carrj^ out simple integrations, we find that 
y»+i = V« + h{y„ + y2Vy’^ + Vii 

^■■■■+anVS„.), (15-0) 


where 


an ^ j 

Jo 


1 X(X + 1) .,. (X + n - 1) 


n! 


dX, 


(K5-7) 


Formula (15-6) enables us to compute the ordinat/O ym+i if we know 
ymy ym) the backward differences When the Vy^ vanish, (15-0) 

reduces to (14-4), and when the V^y'^ vanish, we get (14-9). As was the 
case with (14-9), the values of ymj y'm and the V^yh in (15-0) are not avail¬ 
able to us at the start. They must be computed i)y some means before 
(15-6) can be used to evaluate The number of the depends on 

the degree n of the polynomial chosen to approximate y'{x). Once we 
agree on the value of n, we can compute y,n, y'm and the requisite numl^er 
of the Avith the aid of Taylor^s representation of the solution y = y(.r), 
as was done in Sec. 14. 

We illustrate the procedure in detail in the following example. 

Example: Use Adams' method to obtain, in the interval (0,1), an ap¬ 
proximate solution of 

y'^ y + X, (15-8) 

taking on the value yo — I at x — 0. 

Let us subdivide the interval (0,1) into subintervals of length /i « 0.1, 
so that 

Xjc *= xq *4“ hh 0.1/c, ifc = 0, 1, 2, ..,, 10. 

Furthermore, let us agree to retain in (15-6) the differences of y'm up to and 
including those of order 3. This corresponds to approximating y\x) in 
(15-2) by a polynomial of degree 3. 


^ By (IM) dx^ h dX, and at the limits x ^ Xm and af « a:w -f A, the values of X 
are X 0 and X » 1. 
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To compute from (15-6) we need 2 /«, The 

culciilation of the third differences requires at least four values 

Vtni Vm^h 2 /'m -25 Vm^Zj ^ is obvious from the following table. 


Vm~~Z 

Wm-2 

V*l/m-l 



vV- 

J/m-1 

V2/m 


Vm 



If we determine t/o, zyl, z/ 2 , z/ 3 , we shall be in a position to fill in the values 
in this table with vi - 3 and then proceed to determine 2/4 from (15-C). 
Since ?/o = 1 for X{) = 0 , Eq. (15-8) yields 

Vo - 1. (15-9) 

To compute yi, 2 / 0 , and 2/3 we use Taylor’s series 

n tH 

y{^) = yo + y'oix - j-o) + ^ ~ 


\\ith ijq — 1 and Jo d. The coefficients in (15-10) can be calculaujd 
from (15-8). Differentiating (15-8), we get 

2 /''W == y'M + 1, (15-11) 

and on setting x ~ 0 and rei'aHing that y'{0) - y'o = 1, we get y"(0) =« 2 . 
Successive differentiations of (15-11) give 

y"'(x) = y"(x), y'\x) = y'''{x), 

(15-12) 


and since y”i0) — 2, we get from (15-12) 
y"'i0) = 2 , y"'{0) - 2 , 

Accordingly, (15-10) becomes 


a:® X* T® 

2 _1_I_I_ 


y{x) ==l+x-l-x^-| -b 

3 3-4 3-4-5 

Setting X = 0 . 1 , we get 

, (0.1)® (0.1)^ (0.1)® 

2/1 - 1 -b 0.1 + (0.1)® + ^ ^ ^ + 

3 3-4 3‘4-5 

In the same way using x = 0.2 and x = 0.3, we obtain 

1/3 « 1 . 2428 , 2/3 1 . 3997 . 


2 /<"H0) = 2. 
+ •••. 


1.1103. 
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The deeired values of yu yii ai^d yz 
W© find that 


{CHAF. 9 

can now be computed from (15^). 


y[^yi+xi^ L1103 + 0,1 «« 1.2103 
^2 =® ^2 + ^2 = 1.2428 + 0.2 = 1.4428 
y^^yz + xs^ 1.3997 + 0,3 « 1.6997. 

We can now proceed to construct the table of difiterences shown below. 


X 

V 

y' 

w 

vV 

V*y' 

0 

1.0000 

1.0000 







0.2103 



0.1 

1.1103 

1.2103 


0.0222 





0.2325 


! 0.0022 

0.2 

1.2428 

1,4428 


0.0244 





0.2569 


1 0 0026 

0.3 

1 3997 

1.6997 


0.0270 





0.2839 



0.4 

1.5836 

1.9836 




0.5 

1.7974 






The substitution from this table in (15-6), with m = 3 and n = 3, yields 
1/4 « 1.3997 -f- 0.1[L6997 + H(0.2569) + 5.^2(0.0244) + ^(0.0022)] 

« 1.5836. 

This value is recorded in the table for x = 0.4. 

To compute yn we must extend the table, since formula (15-6) requires 
the knowledge of yi and assorted differences of yi. By (15-8) 

yi = y4 + .T4 - 1.5836 + 0.4 = 1.9836. 

The calculated values (recorded below the heavy line in the table) can 
now be used in (15-6), with w = 4, n = 3, to compute y^. We have 

ys » 1.5836 + 0.1[1.9836 + 3^(0.2839) + ^2(0.0270) + ^(0.0026)] 

- 1.7974. 

This value is recorded in the table for x = 0.5. 

We leave it to the reader to make further extensions in the table re- 
qxiired for the calculation of y^, yy, ..., yio. 


PROBLEMS 

1, Complete the table in the Example of Sec. 15 by computing y$y Vit •.yio. 

8. Since (158) is a linear equation, ii» solution satisfying the condition y(0) 1 
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i» easily found to be y * 2e* a; — 1. Compare the exact and the approximate values 
Vh Vii • • • ? yio- 

3. Apply th(5 Adams method to obtain an approximate solution of y* ^ y with y(0) 
«= 1. Use h » 0,1, and compute y(0.3), y(0.4), y(0.5), and y(0.6) from (15-6) with n «• 2. 
Compare with the exact solution. 

4. Use h « 0,1 and (15-6) with n « 3 to find an approximate value of |/(>-0.6) for 
the integral curve of y' « a;^ -h J/® through (^1,0). 


16, Equations of Higher Order. Systems of Equations. The methods 
of Secs, 14 and 15 can be extended to obtain numerical solutions of equa¬ 
tions of higher order. Thus, the second-order equation 

v" = fix,y,v’) (16-1) 

with initial conditions 

y(xo) = 2 / 0 , y'ixo) = y'o (16-2) 

can be written as a system of two equations of first order by setting 

y' = 2 . (16-3) 

The substitution in (16-1) from (16-3) then yields the second equation 

2 '=/(x,y, 2 ). (16-4) 


In indicating the extension we shall consider, instead of the system (16-3) 
and (l(>-4), a more general system 

y' = f^{x,y,z), 


2 ' = f2ix,y,z), 


(16-5) 


with initial conditions 

y{xo) = yo, 2 (xo) = 2 o- (16-6) 

When solutions of the system (16-5) can be expanded in Taylor’s series 


yix) =« 2 /(a:o) + y'(xo){x - xq) -f {x - Xo)* + • 

^ 1 

2(X) = 2(xo) + z'{Xo){x - Xo) -h —^ (® - ^o)^ + ’ 


(16-7) 


the coeflScients in (16-7) can be computed by differentiating Eqs. (16-5) 
successively as was done ^ in Secs, 14 and 15. 

The construction of Euler’s polygonal approximation also follows the 
pattern of Sec. 14. Thus, the equation of the straight line through (xq, 3 / 0 , 20 ) 
tangent to the integral curve of the system (16-5) is * 


2/ - 2/0 == /i(a:o,2/o,2o)(x - Xo), 
2 - 2o = 72 (^ 0 . 2 / 0 , 20 )(x - Xo). 


( 16 -«) 


' See in this connection Sec. C, Chap. 3. 

*The integral curve of the system (16-5) is, in general, a space curve, so that the 
tangent line to it is determined by the intersection of the planes (16-8). 



7^ NUMBBICAti ANALYSIS (CHAP. 9 

When abscissas are spaced uniformly h units apart, 

ici «« Xo + ^ X2 =* xo + 2/i, ..., Xjfe ** Xo + hh, 
and from (16-8) it follows that the approximate solutions at xi, x%, *.. are 
Vi “ Vo +/i(xo,yoj2'o)^, 

Zl ^ 2 o +/2(3:o;yo,2o)^, 
ya *= y\ +/i(xi,yi, 2 i)A, 

22 = 2l + f2i^\,yu^x)K 


Vh^i = Vk + h{^'k,yhZk)K 
2*4-1 = + hi^k,yk,Zk)h. 


If, instead of approximating the solution in each interval by a linear func¬ 
tion, we make use of the polynomial approximations in the manner of 
Sec. 15, we obtain 

Vm+l = Vm + %m + M Vj/L + Hi ^ On 'Ty'^] 

, , o , , (16-9) 

2m+l = 2m + H + Hi ^-h ®b 


with On determined by (15-7). 

In computing ymi^i and 2,^41 from (16-9), we must first obtain the values 
of ym$ 2to, ymf z'm and the required differences, as was done in Sec. 15. 


Example: Obtain the solution of the system 

- a: -f « 


« 1 + y 


(16-10) 


in the form (16-7), which is such that 

1 /( 0 ) - ~ 1 , z{0) • 1 . 

On Betting xo *• 0 in (16-7) we get 

»(i) “ yiO) + y'(Q)x -h ^ »"(0)i* -t- • • • 

e(i) - *(0) + *'(0)1 -f ~ *"(0)i® -)-, 


(16-11) 


(16-12) 


the coefficients in which can be computed by differentmting (16-10) and noting (16-11). 
We obtain from (16-10) 


y"(x) « 1 -1- *'(*), 

*"( 2 ) - y'(x) 

j,"'(x) - *"(x), 

1 


*(")(!) - y<—0(*). 


(16-13) 
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The eubstitutioiifroiA (16-11) in (16-10) yields |/'(0) 1, «'(0) *1-1 «• 0, and making 

use of these values in (16-13) we find 

y"m - 1 -{- 0 - 1 , s"( 0 ) « y^( 0 ) - 1 , 

y'"(0) « *^'(0) « 1, « y"(fl) « I. 

y^^\0) « 1 , «<»‘H 0 ) » 1 . 


Accordingly, (16-12) yields 

35® 

y(i) --l+x + ~ + -4-*-- 

X® X® 

2 (x) «ri4-0x4*^4*^H-. 


(16-14) 


By eliminating z from the system (16-10) we see that it is equivalent to the second- 
order equation 

y" -y ^2 

with 1 /( 0 ) e* —1, i/'(0) « 1. Its solution is readily found to be 

2, (16-15) 

and from the first of Eqs. (16-10) we conclude that 

z « e* - X. (16-16) 

The Maclaurin expansions of these solutions are precisely (16-14). 

It may l)e instructive to compute the polygonal aj>pioximations to the solution of 
(16-10) at X «* 0.2 and x « 0.4 
On setting the diff<‘reiices in (16-0) equal to zero, we get 

Vm+l I/m *i“ 2,n "4" 

Now, if we take Xi ** 0.2, so that h =« 0.2, wc obtain from (16-17) 

I/I « y(0,2) « -1 -f (0.2)1 *» -0.8, 

zi « z(0.2) « I -f (0.2)0 « 1, 

since j/o *• 1 and zo 0* 

The exact solution (16-15) and (16-16) yields 

yiO.2) « - 2 « -0.7786, 

z(0.2) » - 0.2 « 1.0214. 

Using vi »» —0.8, Zi =* 1 in (16-17), we obtain 

yi * |/(0.4) « 1/1 4- 0,2yl 

Z2 « 2(0.4) « 2i -f 0.2z^ 

The values of y'l and z[ can be calculated from (16-10) by setting z « 0.2, z » 1, and 
y « —0.8. We find that 

j/i ~ 0.2 + 1 » 1.2, 2i'« 1 +1/1 « 1 -0.8 « 0.2, 


(16-17) 


(16-18) 
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« -0.8 -f (0.2)C1.2) » -0.56, 

^2 «» 1 “f (0.2)(0.2) «• 1.04, 
while the corresponding exact values are 

j/(0.4) » - 2 « -0 5082, 

2(0.4) « c®--* - 0.4 1.0918. 

The reader is advised to obtain more accurate polygonal approximations by taking 
the interval A « 0.1 and to compare the |H>lygonal approximations with the values given 
by (16-9) in which the differences of order higher than 1 are set equal to zero. 


PROBLEMS 

1 . Obtain from (16-7) a fourth-degree polynomial approximation to the solution of 

y' sa g* -f z* «« e~* -f y 

with y(0) « 0 and z(0) « 0. 

2. Use a polygonal approximation to compute yi, y% yz, 1/4 for the system in Prob. 1 
by taking xi 0.1, is « 0.2, xz — 0.3, 14 =* 0.4 

8. Use a polygonal approximation to compute yi, y% 2/3, corresponding to xi « 0 1, 
X 2 0.2, xg ** 0.3 for y” — j/^ » x, wdth initial conditions j/(0) =* 1, y’{Qi) » 0. Hint 
Set y' ** z, and consider the system j/' *= z, z' «« x -f with 2 /( 0 ) « 1 and z(0) =» 0. 

4, Obtain the solution for Prob 3 in Maclaurin’s series. 

5. Solve the system in Prob. 3 by the Adams method. Retain only the second dif¬ 
ferences in (16-9), and use the result of Prob. 4 to start the iteration. 

17. Boundary-value Problems. In many physical problems solutions 
of the second- and higher-order differential equations are required which 
satisfy preassigned conditions at more than one point of the interval. A 
simple example of this occurs in the study of deflections of a beam supported 
at several points. Problems of this sort are termed boundary-value prob- 
lenis to distinguish them from imtial-value problems in which the conditions 
on solutions are imposed only at one point. 

An important feature of the boundary-value problems is that their 
solutions (if they exist at all) need not be unique.^ When the general 
solution of the differential equation can be obtained, the conditions im¬ 
posed on solutions of the boundary-value problem can usually be met by 
determining the values of arbitrary constants in the general solution* 
so that the specified conditions are satisfied. How^ever, general solutions 
of differential equations can rarely be written down, and one is obliged to 
seek solutions of boundary-value problems by numerical methods. The 

* See, for example, our discussion of two interesting two-point boundary-value prob¬ 
lems in Sec. 34, Chap. I. 

*This was the procedure followed in solving the boundary-value problems in Sec. 
34, Chap. 1. 
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methods available for numerical solution of initial-value problems require 
that the integral curve be uniquely determined at the starting point and 
thus do not apply to problems in which solutions must satisfy specified 
conditions at more than one point. To kSoIvc a boundary-value problem 
numerically one must employ laborious trial-and-error procedures utilizing 
the solutions of suitable initial-value problems. 

We outline briefly the procedure commonly followed in solving a two- 
point boundary-value problem for the second-order difTercntial equation.* 

Let it be required to determine a solution of 

which assumes at the end points of the interval a < x < b the values 

y(a) = A, tj{b) - /L (17-2) 

Now', if in addition to the value y{a) = A we specify the slope ]/(a) 
at j = a, the solution of (17-1) is uniquely determined,^ but this solution 
will satisfy <he condition y(b) = B only for some value of the slope y'ia) 
\vhi(‘li is not knowm.^ Physical or geometric considerations may suggest 
an approximate value of the slope, say y'(a) == C, which is such that the 
integral curve of (17-i) satisfying the conditions 

y(a) = A, y\a) = C (17-3) 

also satisfies tlie condition y{b) = B. 

The procedure used in solving the boundary-value problem consists in 
actually constructing the solution y = y{x) satisfying the conditions (17-3) 
and computing the value oi y{x) at x == 6. If it is tolerably near B, we 
have the desired approximate solution of the boundary-value problem. 
If not, we choose another value of the slope y'ia) and try again. The 
procedure is c'learly laborious and far from being elegant. 

18. Characteristic-value Problems. Closely associated with boundary- 
value problems are charaxteristic-value problems. These are generally 
concerned with solutions of the two-point boundary-value problems for 
differential equations containing parameters. 

A simple instance of the characteristic-value problem occurs in the study 
of small vibrations of an elastic string of finite length.'* When initial shape 
and initial velocity of the string are specified, its subsequent displacement 

* For a more detailed discussion of such problems see W. E. Milne, ‘^Numerical 
Solutions of Differential Equations,” chap. 7, John Wiley & Sons, Inc., Now York, 
1962. 

* We suppose that/(x,y,i/') is such that the initial-value problem has a unique solution. 

*We assume that the boundary-value problem in (17-1) and (17-2) indeed has a 

solution. 

^ See Chap. 6. 
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( 18 - 1 ) 


are fixed at x 
ocmditions 


m(x, 0 is detennioed by solving the equation 

d®«x d*« 

a? a?' 

where a is a physical constant. If the string is of length I and its ends 
0 and a; = 1, the solution of (18-1) must satisfy the end 

m(0,<) = 0, u(l,i) = 0. (18-2) 

When we attempt to obtain solutions of (18-1) by the method of separa¬ 

tion of variables,* that is, by assuming that u{x,i) is expressible in the form 

u{x,t) = y{x)T(t), (18-3) 

where y(®) is a function of x alone and T{t) is a function of i alone, we are 
led to a pair of ordinary differential equations 

+ X*3/ = 0, (18-4) 




0 , 


where X is a constant. This constant must be chosen so that the end 
conditions (18-2) are satisfied. 

From the assumed form of solution (18-3) and from (18-2) it follows 
that the solutions of (18-4) must be such that 


2/(0) = 0, y{l) « 0. (18-5) 

We thus have a two-point boundary-value problem for Eq. (18-4) with the 
end conditions (18-5). 

The determination of suitable solutions this time is very simple because 
the general solution of (18-4) is 

j/ = Cl cos Xx + C 2 sin Xx. (18-6) 

If we impose the conditions (18-5) on (18-6) and reject the trivial solution 
y » 0, we find infinitely many solutions 

2/«C2 8inXx, (lg«7) 

kv 

where X = (18-8) 

I 

The values of X in (18-8) are called the characteristic values of the boundary- 
value problem of ^8-4) and (18-5), and the solutions (18-7) with appro¬ 
priate Xs are characteristic functions of this problem.* 

‘ See Sec. 10, Chap. 6, 

• The tenuB eigermslm and mgmfuincUm are used by some writera to mean **ohareo- 
temtic value'* and “characteristic function," respectively. These stem from Gennan 
words Migemmt and Eigef^unkt/im, We eschew the hybrids, sinoe this book is written 
in £iiigUsh. 
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Tlie simplicity erf the characteristic-value problem defined by (18-4) 
and (18-5) masks some important features of the general problem- These 
features become clearer if we consider the determination of small vibrations 
of an elastic rod of variable cross section. In this case the separation of 
variables in the appropriate partial differential equation leads to the equa¬ 
tion 

^ I^P(^) - ^7(ic)y * 0, (18-9) 

in which p(x) and q(x) are known functions and X an unknown constant. 
If the rod is of length Z, with the end points at a: ~ 0 and x =« Z, the solu¬ 
tions of (18-9) must satisfy suitable conditions determined by the mode 
of fixing the ends. If the end x = 0 is clamped, then p(0) — ^'(0) *= 0; 
if it is simply supported, then y(0) — y"(0) == 0; if it is free, then y"(0) =« 
y'"(0) « 0. Similar conditions arc imposed at the end x ~ Z. 

For definiteness, we suppose that the ends of the rod are free (a ship 
floating at sea). We then seek a solution of (18-9) such that 

2/"(0) - y"'(0) - 0, y"(l) *= 2/'"(Z) = 0. (18-10) 

Since (18-9) is a linear equation, its general solution is the sum of four 
linearly independent solutions 

j/(x,X) Ciyi(x,\) + C 2 j/ 2 (x,X) + + 04 ^ 4 (x,X) (18-1 i) 

where X is the parameter appearing in (18--9) and the c^ are arbitrary con¬ 
stants. On imposing the end conditions (18-10) on (18-11) we get a sys¬ 
tem of four equations: 

cmi0,\) + C2y2(0,X) 4* C32/3(0,X) + c^yi{0,\) = 0, 

CiViiO.X) + C 21/2 (0,X) + C3y7{0,\) + C4yi\0,\) = 0, 

CiViiW + + csylm + cmiW = 0 , 

cmiW + C2y2{l,\) + C3yl\l,X) + c^ylil.X) 0 . 

This system of four linear equations in the unknowns c, will have a non¬ 
trivial solution if, and only if, the determinant D{X) of the coefficients of 
the 08 is zero.^ The equation 

D{X) = 0 (18-12) 

is the characteristic equaiioUj and its solutions are the characteristic values 
of the problem. In general (18-12) is a transcendental equation, and its 
solution poses many vexing problems.^ Usually it is solved by numerical 

‘ See Appendix A. 

*An instance of a simple transcendental characteristic equation appears in Sec. 10, 
Chap. 6, Eq. (10-16), in which the parameter is denoted by i5. See also Sec. 36, Chap. 1, 
Eq. (36-4), where D(X) 0 i» an algebraic equation. 
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meihode. Because of the importance of characteristic equations in ana¬ 
lyzing the behavior of dynamical systems, they have been studied ex¬ 
tensively and there is a vast literature on the subject of numerical deter¬ 
mination of characteristic values.^ 

19. Method of Finite Differences, We conclude this chapter with a 
brief description of the most commonly used method for solving boundary- 
value problems in partial differential equations, known as the method of 
finite differences. In this method the differential equation is replaced by 
an approximating difference equation, and the continuous region in which 
the solution is desired by a set of discrete points. This permits one to 
reduce the problem to the solution of a system of algebraic equations, 
which may involve hundreds of unknowns. Ordinarily, some iterative 
technique has to be devised to solve such systems, and high-speed ekn*- 
tronic computers have been developed largely because of the need for 
coping with problems of this sort. 

The main disadvantage of all numerical techniques is that they give 
numerical values for unknown functions at a set of discrete points instead 
of the analytic expressions defined over the initial region R. Of course, 
when the boundary-value data are determined by measurements at a 
finite set of points of R, the difference-equations methods may be the best 
mode of attack on the problem. Any analytic technique would require 
fitting curves to the discontinuous data. 

We proceed to the outline of the general procedure followed in reducing 
the given analytic boundary-value problem to a problem in differen(*e e(|ua- 
tions. For definitene&s let the region R be bounded by a simple closed curve 
C. We seek to determine the function satisfying a given differential 

equation in i?. From the definition of partial derivatives it follows that ^ 

du u(x + h, p) — u(x,y) 

— =; lim- 

dx h 

Also, if the second partial derivatives are continuous one can show that 

d\ u(x + h,y) - 2u{x,y) + u(x - h, y) 

= hm-- 

dx^ h~*o h 

d^u u{x + h, y + k) — u(x + h, y) - u{x, y + k) + u{x,y) 

—— « hm-* 

dx dy k-*o hk 

and so on. 

For small values of h and k the partial derivatives are nearly equal to 

‘ For bibliography see Milne, op. eU., and F. B. Hildebrand, '‘Introduction to Nu¬ 
merical Analysis,” McGraw-Hill Book Company, Inc., New York, 1956. 

*See Chap. 3, and Chap. 6, Sec. 21. 
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the difference quotients appearing in the right-hand members of these 
formulas. If one replaces derivatives in the given differential equation 
by difference quotients, there results a difference equation which is a good 
approximation to the given equation when h and k are small. 

Thus, to Laplace’s equation 


V^u 


d^u d^u 

—- -1 -- = 0 , 

dx^ dy^ 


there corresponds the difference equation 


— 0 , 

where 

^ "2 [u{x + h,y) - 2u{x,y) + u{x - A, y)], 
1 

s — fw(^, y + h) — 2u{x,y) + u{t, y - h)]. 


In a differen(‘e equation the values of u(x,y) are related at a set of discrete 
points determined by the choices 
of h and k. Ordinarily these 
points are chosen so that they form 
a square net ^ with specified mesh 
size h. 

The usual procedure is to cover 
the region R by a net ermsisting of 
two sets of mutually orthogonal 
lines a distance h apart (Fig. 1-4) 
and mark off a polygonal contour 
(^' so that it approximates suffi- 
(‘iently closely the boundary (\ 

The domain R' in which the solu¬ 
tion of the difference equation is 
sought is formed by the lattice 
points of the net contained within 
r'. The assigned boundary values 

on C are then transferred in some manner to the lattice points on C\ 
When the lattice points on T' do not coincide with points on C, the 
desired values can be got by interpolation.^ 



^ liectangular, polygonal, and curvilinear nets aie also used. See, for example, 
D. Y. Panov, "Handbook on Numerical Solution of Partial Differential Equations,” 
Mohoow, 1051, which contains a good account of the difTen‘nce-equation8 techniques. 
8ee also Appendix to 8. Timoshenko and J. N. Cloodier's "Theory of Elasticity,” 1051. 

*See, for example, Milne, op. ciL, or L. M. Milne-Thomson, "Calculus of Finite 
Differences,” 193E. 
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One then seeks e solution of the difference equation which satisfies the 
boundary conditions imposed at the lattice points on C'. Usually, this 
leads to a consideration of a system of a large number of algebraic equations 
in many unknowns*^ 

^ Further discussion of difference equations is given in Chap. 6, Secs. 26 and 27, and 
in Chap, 8, Sec. 10. See also chap. 10 by T. J. Higgins in L. E. Grinter (ed.), “Numerical 
Methods of Analysis in Engineering/' 1949. 

The literature on finite-difference methods is extensive. An illustration of the use 
of the method of finite differences in solving a boundary-value problem in Laplace's 
equation is included in I. S. Sokolnikoff, “Mathematical Theory of Elasticity/' sec. 124, 
MoGraw-Hill Book Company, Inc., New York, 1956, which contains further references. 
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DETERMINANTS 


1. The Definition and Properties of Determinants. A determinant of 
the first order consists of a single element a and has the value a. A de¬ 
terminant of the second order contains four elements in a 2-by-2 square 
array and has the value 


di 02 
bi 


= (iih2 — U 261 . 


( 1 - 1 ) 


A determinant of third order is similarly defined, in terms of second-order 
determinants : 


dl Q>2 CLq 

bi 62 ^3 

Cl C2 C 3 



h 

bz 


bi 

bz 

+ Cl'S 

bi 

&2 

= ai 

i 

C2 

Cz 1 

~ 02 

Cl 

Cz 

Cl 

C2 

1 


(1-2) 


By analogy, a determinant of order n consists of a square n-by-n array 
of elements : 


an 

ai2 • 

• ain 

<^2l 

a22 * 

• «2n 

am 

a„2 * 

■ • O-nn 


to which a numerical value is assigned as follows: Denoting the deter¬ 
minant by Z), let the elements in the first row be au, and let Mu be tlie 
determinant of order n — 1 formed when the first row and fth column of 
D are deleted. Then, by definition, 

-0 = aiiMn - 012^12 -[-••• + ( 1 - 3 ) 

The definition is inductive; a determinant of order n is defined in terms 
of those having order ri — 1. 

The expansion (1-3) is termed a Laplace development of the determinant 
on elements of the first row. The determinant Mu is called the minor 
of the element au; the signed determinant is the cofactor of 

ait% More generally, the determinant Mij formed when the fth row and 

741 
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jth column are deleted is the minor of the element a*,* in this row and 
column. The signed determinant 

^ ( 1 - 4 ) 


is the cofactor of It is a fundamental theorem that a determinant may 
he mduated by a Laplace development on any row or column; in other words, 


flu 

Ul2 * 

* Uln 



^21 

U22 • 

• U2n 

n 

a>ijAij = 

i^X 


Uni 

Un2 • 

* * Unn 




The proof may be given by induction directly or may be based on the following 
considerations, which are also established by induction. The expansion of an nth-order 
determinant is a sum of the nl terms ( — . .a^^n, where A^i, ^ 2 , ..are the 

numbers 1,2, ..., n in some order. The integer k is defined as the number of tnverBtmia 
of order of the subscripts ki, k^, .. kn from the normal order 1, 2, ..., n where a par¬ 
ticular arrangement is said to have k inversions of order if it is necessary to make k suc- 
cesave interchanges of adjacent elements in order to make the arrangement assume the 
normal order. There are terras, smce there are n! permutations of the n first suIh 
B cripts, and each term contains as a factor one, and only one, element from each row 
and one, and only one, element from each column. 

For example, consider the third-order determinant 


On 

012 

018 

021 

022 

028 

031 

032 

088 


The six terras of the expansion are, apart from sign, 

ailfl22Gt33, 0ua32023, a2l«12038, <^ia82ni3, 031012028, 081022018. 

The first term, in which the first subscripts have the normal order, Is called the diagonal 
term, and its sign is positive. In the second term the arrangement 132 requires the 
interchange of 2 and 3 to make it assume the normal order; therefore k ^ I, and the 
term has a negative sign. Similarly, the tliird term has a negative sign. The fourth 
term has a positive sign, for the arrangement 231 requires the interchange of 3 and 1 
followed by tlie intercliange of 2 and 1 to assume the normal order. Similarly, the 
fifth term has a positive sign. In the sixth term, it is necessary to make three inter¬ 
changes (3 and 2, 3 and 1, and 2 and 1) in order to arrive at the normal order; hence, 
this term will have a negative sign. It follows that 

D « 0ll022®S8 «ll«82fl23 *“ 021®12<^38 "f 021^82018 "f anOvijlliz ““ ^81022^X8. 

The main result of this discussion is that a determinant is the sum of 
all the n\ products which can be formed by taking exactly one element from 
each row and each column and multiplying by 1 or --1 according to a 
definite rule. 
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6 0 
4 7 
2 0 


The ( — 1)*'^^ rule for determining sign means that the sign of the minors alternates as 
we proceed from one element to an adjacent one in the same row or column, and the 
sign starts with -j- in the upper left-hand corner. Thus, 

D « 0(-Mi 2) -f O(Afjs) -f 7(-Jkrw) -f 0(M42). 

Crossing out the row and column containing 7 gives the determinant Af 82 , whence 


D » -7^32 « (-7) 


1 -1 2 

6 4 3 

2 2 3 

(-7)f(l)(0) ~ (-1)02)+2(4)1 


(-7)[(1)| 


4 3 
2 3 


(- 1 ) 


6 3 
2 3 


+ ( 2 ) 


6 4 
2 2 


-182. 


D 


Example 2. I’ho following determinant is said to be in diagonal form. Show that its 
value is aXicA no matter what elements are put in place of the *8: 

a * ♦ 

0 6 * 

0 0c 

0 0 0 d I 

Successive T^aplace developments on first columns give 

b 

I 

» a 


0 c ♦ 
0 0 d 


ah 


c ’ 
0 d 


abed. 


Evidently, a similar result is true in general. 

Example 3 If th(‘ ehmu^nts are differentiable functions of t, show that the derivative 
of the determinant (1-2) is 


f 

/ 

tt2 

t 

as 


fll 02 O3 


Oi 

02 

Os 

^1 

^2 

^3 

+ 

61' bi 63' 

+ 

hi 

hi 

63 

Cl 

C2 

fa 1 

1 1 

Cl C2 Cj 1 



<^2 

t 


A typical term in the expansion is :±atbjCK. Differentiating gives 

±{athjny = AzatbjCk ± UtbjCk ± a^hjci 

and the sum on i, j, k of these three types of terms yields the expanded form of the 
three determinants. A corresponding result for determinants of order n is proved in the 
same way. 

The fundamental theorem (1-5) leads to some important properties of 
determinants that are now enumerated. 

1. If each element in a row or column of a determinant is zero, the deter- 
minani is zero. 
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2, If mch eiemerU in a row or column i» mtMjdied hy the detmninani 
i$ fmdUjjUed by m, 

8. If each dmmt of a row or column is a sum of two termSf the determinant 
equate the sum of the two corresponding determinants; for exampley 


dl 

h 

Cl 


Oi 

h 

Cl 


dl 

h 

Cl 

a 4* « 

b + ft c + 7 

- 

a 

b 

c 

+ 

a 


y 

Oa 


C% 


02 

63 

C2 


02 

b2 

C2 


These three results become obvious when we make a Laplace develop- 
ment on the row or column in question. In (1-6), for example, let A, B, C 
be the cofactors of the elements in the second row. The determinant is 

(a + a)A + {b + 0)B + (c + 7)C, 

which equals {aA + bB + cC) + ( 0*4 + /3J5 + yC), This, in turn, is the 
sum of the expansions of the two determinants on the right of (1-6). 
The proof for n-by-n determinants is very similar and should be supplied 
by the reader. 

4. If two rows or two columns are proporiiondly the determinant is zero, 

5. If two rows or two columns are interchangedy the determinant changes 
sign. 

6. If rows and columns are interchanged^ the determinant is unaltered. 

The properties 4, 5, and 6 are easily verified for 2-by-2 determinants, 
then proved in general by mathematical induction. To obtain item 6, 
for example, expand the original determinant on elements of the first row 
and the new one on elements of the first column. The theorem for order 
n then follows from the theorem for order n — 1. As an illustration we 
have 


dl 

bi 

Cl 

02 

b2 

C2 

dz 

bs 

cz 


^2 ^2 

1>3 ^3 


— 02 


h 

bs 


Cl 

Cb 


+ dz 


bi Cl 

62 C2 


(1-7) 


which coincides with the expansion (1-2) when we interchange rows and 
columns of the second-order determinants on the right-hand side of (1-7). 

7. The value of a determinant is unaltered if a multiple of one row {or 
column) is added to another. 

8. If the cofactors for one row {or column) are combined with the elements 
of anotherj as in (1-8), the resulting sum is zero: 

n n 

dijAik 0 , h ^ j. 

ipml imml 


( 1 - 8 ) 
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These results follow from those already established. To illustrate the 
proof of item 7, we have 

Oi hi + mci Cl ai hi Ci at mci Ci 

02 62 + ^2 C 2 — 02 ^2 ^2 + 02 mc2 C 2 (1-9) 

03 “H ^z ^3 ^3 ^z ^3 ^^3 Cg 

by 3; and the second determinant on the right of (1-9) is zero by 4. The 
reader should extend the proof to n-by-n determinants. 

The result 8 follows from 4. Thus, the first eiqpression (1-8) is the 
expansion of the determinant which arises when the row 

^Ikf ^2kf • * On*; 

is replaced by oiy, a 2 j, ,.., Onk, and hence it is the expansion of a de¬ 
terminant with two equal rows. 

9. If two determinants A and B of order n are given and a new determinant 
C is formedy the element in the iih row and jth column of which is obtained 
by multiplying each element in the iih row of A by the corresponding element 
in the jth row of B and adding the products thus formedy then C ^ AB. 

Thus, if the elements of A and B are denoted by o»j and respectively, 
then the element in the ith row and jfth column of the product det^^f- 
minant C is 

= Otibji + H-b 0»n^;n* (1-10) 

The validity of rule 9 for determinants of order n follows from considera¬ 
tions entirely similar to those we give next for the case when n == 2 . 

If the determinants A and B are of second order, formula (1-10) states 
that their product C is 

Cf'llbll + Ul2^12 <^11^21 + <^12^^22 

<* 21^11 “b ^22^12 <^ 21^21 “b <^ 22^22 

Since the elements in ( 1 - 11 ) are binomials, we can write C by using prop¬ 
erty 3 as the sum of four determinants: 

O 11611 U 11&21 ^ ^* 11^11 < 3^12622 

021611 021621 021611 022622 

012612 011621 ^ 012612 O 12622 

022612 021621 O22612 O22622 

On factoring out the elements On and 021 in the first determinant, we 
obtain a determinant with two like columns, and hence its value is zero. 
Similar remarks apply to the fourth detenninant. The second deter- 
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minant, oa factoring out 6 u h 2 } yields while the third haa 

the value — 6 i 2 ^ 2 i- 4 . Thus 


C « *4 ( 611^22 ^ 12 ^ 21 ) 


« AB. 


Similar, but much more laborious, calculations can be carried out for deter¬ 
minants of higher order to establish the validity of rule 9. 

Since the value of a determinant is unchanged when its rows and columns 
are interchanged, there are four ways in which the determinant C may 
be written. Thus, if we interchange the rows and columns of B, the ele¬ 
ments Cij in C will be given by ( 1 - 10 ) in which the subscripts on the 6 s 
are interchanged.^ 


Example 4. Without expanding show that 


1 n X? 
I xj xl 

1 X3 X 3 


(Xl - X2)(X8 - X2)(Xi - X3). 


The determinant is a polynomial in Ji, and it vanishes when xj » J 2 , since the first two 
rows are then proportional Hence it is divisible by xi — X 2 . Similarly, it is divisible 
by X 8 — X 2 ajnd xi — X 3 . It therefore equals 

E(Xi - X2)(X3 - X2)(Xi - Xz) 


for some polynomial E. Since the determinant is of degree 3 in xi, X 2 , X 3 , we must have 
E w const, and comparing coefficients of X 2 X 3 shows that E I. 

Example 5. Write the product of the determinant;^ 



1 

2 

1 



-1 

4 2 

A 3 = 

3 

0 

1 

and 

B « 

2 

-1 3 


' 0 

2 

1 1 



0 

2 -1 


as a single determinant of third order. 
Using rule 9 we find 


AB 


-1+8+2 2-2+3 0+4-1 
-3+0+2 fi-0+3 0+0-1 
0+8+2 0-2+3 0+4-1 

9 3 3 

-19-1* 

10 1 3 


To check the result, we find on expanding the determinants A, B, and AB that A - 2 , 
B « 21, and AB » —42. 

Example 6 . Show that a trigonometric polynomial 

y «* ai sin X + 02 sin 2x + aj sin 3x 
J Cl Eq. (16-3), Chap. 4. 


( 1 - 12 ) 
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passi&g through three aseigued points {Xx^y^) is given, in general, by 


y sin X sin 2 x sin Sx 

j/i sin x\ sin 2x\ sin 3xi 

2/2 sin X 2 sin 2 x 2 sin 3x2 

2/8 sin xs sin 2 x 3 sin 3x8 


0 . 


Expanding on the first row gives 
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c\y -f c% sin x -f ca sin 2x -f C 4 sin 3x » 0 

where c* are the appropriate cofactors. Hence y has the form (1-12), if c\ 7 ^ 0. More¬ 
over, when X « x» and 2 / « 2/t the equation is true, since two rows of the determinant 
are then equal. 


PROBLEMS 


1. By a Laplace development on the first row, evaluate 


1 2 3 


-12 2 


1 0 0 


1 2 3 

3 1 2 

. 

-3 6 6 


0 0 1 

, 

4 5 6 

2 3 1 


5 7 9 


0 10 


7 8 9 


2. Evaluate the deU)rniinants in Prob. 1 by a Laplace development on (a) the first 
column, (6) the second row. 

3 . Evaluate this determinant by develofuncnt on 


(а) The first column 

(б) The second row 

(c) The first row 

(d) The third column 


1 -1 

0 1 

0 0 

1 0 


1 -1 

-1 1 

1 -1 

0 1 


4 . Show that 


xi 1 
X2 1 


Xi 

yi 

1 

X2 

y 2 

1 

X3 

ys 

1 


represent, respectively, the (signed) length of the segment (xi,X 2 ) and the area of the 
triangle with vertices (x,,!/,). 

6. Evaluate, using some of the properties 1 to 7: 


X 1 1 


y 4- 2 X X 


0 -0 -b 

1 X 1 

I 

y X + r y 


0 

0 

1 

1 1 X 


z z X 4- y 


b c 0 


Hint' In the last determinant, interchange rows and columns. 

6. Write out as determinants of third order the product of the first determinant in 
Prob. 1 by the second and third determinants. 

7. Using determinants, find a, 6, c if y ■« a -f 6 cos x -f c cos 2x passes through (0,0), 
(r/2,1), (ir,-2). 

8. (a) Find a cubic containing the points (0,1), (1,-1), (3,4), (4,0). Hint: Consider 
a determinant with top row y, 1, x, x®, x^. (6) Write the equation of a polynomial of 
degree n whose graph contains n + 1 assigned points (x*,yt). 
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S. Cim&er’fi Ruk. Consider the set of simultaneous equations 

aix + bip + Ciz a= di, 

Of^ + hy + ^ d2t 

Oax + b^y + c^z » da. 

Now, by 2 and 7 of the preceding section, 


(2-1) 



ai 

6 i 

Cl 


aix 

bi 

Cl 


OiX 4- biV + CiZ 

bi 

Cl 

X 

(l2 

bi 

C2 


a^x 

bi 

C2 


OiX + 62 ^ + CiZ 

bi 

C2 


03 

bs 

C2 


dsX 

bi 

cb 


+ bay + csz 

ba 

cz 


Hence if z satisfies (2-1), it is necessary that 



ai 

bi 

Cl 


di 

bi 

Cl 

X 

^2 

^2 

C2 


di 

bi 

C2 



^3 

cz 


da 

ba 

Cb 


(2-2) 


The determinant on the left of (2-2) is termed the coefficient determinant 
of the system (2-1); we denote it by D. Equation (2-2) and the cor¬ 
responding relations for y and z may then be written 


(2-3) 


If D 3 ^ 0, we may divide by D to express x, y, and 2 as quotients of two 
determinants. 


xD == 

di hi Cl 

d'2 ^2 C 2 

• yD = 

Ui di Cj 

^2 d2 C2 

II 

Ui hi d\ 
dz ^2 d2 


dz c^ 


! dz ds C3 


dz hz dz 


To show that these values of a;, y, and * actually satisfy the system (2-1), substitute 
into (2-1) and multiply through by Z>. The equations become 



di 

h 

Cl 

1 

ai 

di 

Cl 


1 Ol 

h 

di 


1 «1 

bi 

Cl 

Uibj 

(k 

h 

Cj 

+ 6jfcj 

02 

d2 

C 2 


02 

62 

<h 


02 

bi 

C 2 


dt 

h 

C» 1 


ai 

di 

Ci 


! Qi 

h 

di 


0 * 

b$ 

ca 


with ■■ 1, 2, or 3 respectively. Now, the determinant 

Ofc h Ck dk 
ai bi d di 

(H h cz dt 
h Ci dd 

»eKi because twp rows are equal, and it yields the desired relation when eitpanded on 
elements of the first row (use Theorem 6 of the preceding section). 
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The foregoing method applies to n equations in n unknowns, and jirields 
Ceameb’s Rule: Lei ^ 


Oll^l + 012^1 + * • • + dlnXn “ 
a^lXi + a22^2 4-f" 0>2nXn *= ^2, 

( 2 - 4 ) 


<2nl^l + Cln2^2 “i -i* (^nn^n 

he a system of n eqvLaiions in the n unknowns Xi 8^lch that the coefficient de¬ 
terminant D is not zero. The system (2-4) has a unique solution Xt = Di/D^ 
where Di is the determinant formed by replacing the elements au^ a 2 t, ..Oti* 
of the ith column of D by kiy fc 2 > • • •, respectively. 

Consider the homogeneous system which arises from (2-4) wdien the 
right-hand members are replaced by zero. This system obviously has a 
solution = a :2 =•••== = 0, the trivial solution. If the coefficient 

determinant is not zero, the solution is unique by Cramer's rule. Hence 
a homogeneous system can have a nontrivial solution only if the coefficient 
determinant is zero. One can prove, conversely, that there is always a 
nontrivial solution of the homogeneous equations if the determinant is 
zero. 

The rectangular array 

( Ui i>i Cl di\ 

02 b2 C2 d2 j (2-5) 

03 h C 3 dff 


is termed the augmented matrix of the system (2-1). By striking out one 
or another column of the matrix (2-5), we are led to the square arrays 




bi 

Ci\ 

02 

^2 



^3 

cj 


Since these arrays are square, they have corresponding determinants. 
Now (2-3) shows that all these determinants must be zero if Z> = 0 and if 
the system (2-1) actually has a solution. In other words, if D = 0 but 
a third-order determinant formed from (2-5) is not zero, then the system 
(2-1) is inconsistent. 

The foregoing results are included in a general theory of linear systems, 
which is now discussed. An m-by-n matrix is a system of mn quantities 
a*;, called elements^ arranged in m rows and n columns. The array is cus- 

^ A compact derivation of this rule is given in Sec. 15, Chap. 4. 
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tomarily enclosed in parentheses, thus: 



Uxi 

0x2 * 

• • a.in \ 

A fiE 

' ^21 

i * * 

^'22 * 

•• «2n 


\^1 

* 

■ ' O'fn.nf 


If m *= n, then A is the coeffijcient matrix of the system (2-4); the augmented 
matrix is obtained by adjoining a column with elements (in order) fci, 
k 2 , * • If the matrix is square, one can form th^ determinant of the 

matrix, a determinant whose elements have the same arrangement as 
those of the matrix. From any matrix, smaller matrices can be formed 
by striking out some of the row^s and columns. Certain of these smaller 
matrices are square, and their determinants are called determinants of the 
matrix.. A matrix A is said to be of rank r if there is at least one r-rowed 
determinant of A that is not zero, whereas all determinants of A of order 
higher than r are zero or nonexistent. (The latter alternative arises if r 
equals the smaller of the two numbers m and n.) The rank is zero if all 
elements are zero. With these preliminaries we can state the following 
Fundamental Theorem: Suppose we are given a set of m linear equations 
in n unknowns. Let the rank of the coefficient matrix he r, and lei the rank of 
the augtnmted matrix he r'. If r' > r, the equations have no solution. If 

^ r ^ n, there is one, and only one, solution. If r' = r < n, we may give 
arbitrary values to n — r of the unknowns and express the others in terms of 
these. 

The proof is too long for inclusion here. Important special cases were 
established, however, by the proof of Cramer^s rule and by the discussion 
of (2-5), Further discussion of matrices is given in Chap. 4. 

The r unknowns which are expressed in terms of the others must be asso¬ 
ciated with some nonvanishing determinant of order r. 


Example 1. By Cramer’s rule, find x and given 

3x 4" y ~h 2* « 3, 

2x 3y — x ** —3, (Z-fi) 

a: -f 2y 4- 2 » 4. 

The coe£5oient determinant D is found to be 8, so that 



3 

1 

2 



3 

3 2 

8x - 

-3 

-3 

-1 

-8. 

8y 

2 

-3 -1 


4 

2 

I 



1 

4 1 


Thus, X * 1, y -■ 2. If 2 is desired, one can find it from the third equation (2-6): 
2*»4—X — 2y»»4 — 1 ——1. 
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Ezampk 2, For what values of X do the equatious 
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a2^ -f hy *f ^2 =* 0. 

Let (ar,y) be the point at whirh the three lines meet. With this particular choice of 
X and y, the three equations (2-12) are satisfied simultaneously. Now, these equations 
may be regarded as simultaneous equations in three unknowns x, y, and 1, one of which 
(namely, 1) is not zero Hence the coefficient determinant vanishes: 

a b c 

ai hi Cl *» 0 . 

ua ^2 C2 



7S2 


ArraNDCK 


[api», a 

(The condition is also eoflicient if no tifro of the three lines are parallel The reader 
should observe the duality between points and lines which is illustrated by this and 
the following example.) 

Example 5. Find a necessary and suficient condition that the three points 
(sc»,ys) He on a Hne, 

If the equation of the line is 

<nc -j“ hy “f* c "■ 0 (2-121) 

we have, besides (2-13), 

0*1 4“ hyi + c » 0, 

4- 4* c w 0. 

These equations may be regarded as a system in the unknowns a, &, c, which cannot 
all vanish if (2-13) represents a line. Hence the coefficient determinant must vanish: 

X y I 

xi yi 1 «0. (2-14) 

X2 yi 1 

Conversely, (2-14) ensures that the system has a nontrivial solution a, h, c. Compare 

Prob, 4, Sec. 1. 

Example 6. Show that the following equations are consistent if, and only if, ifc «* 9: 

2x 4“ 3y « 1, 

* ~ 2y « 4, (2-15) 

4a: — 1 / « fc. 

The coefficient matrix has rank 2, and hence the equations are consistent if, and only if, 
the augmented matrix also has rank 2. This entails 

2 3 1 

1 -2 4 -0, 

4 -1 A; 

which yields 1(7) — 4( —14) 4- A:(—7) * 0 or k « 9. The same result is found if we 
regard (2-15) as a system in the three unknowns x, y, k and solve by Cramer's rule. 
The reader should obtain the result of Examples 8 and 4 by considering the augmented 
matrix, as in the present example. 

PROBLEMS 

X. Solve, by Cramer’s rule, the systems: 

(a) X 4" 2y 4“ 32 3, (5) 2x 4* y 4" 3® 2, 

2x — y -h 2 “ 6, 3x - 2y — 22 » 1, 

3a; 4* y - 2 « 4; x—y4-2«—1; 

(c) X 4- 2y « 1, (d) 2x 4- y 4- 32 4- *» -2, 

2x - y — 22 w 3, 6x -f 3y - 2 — tr « 1, 

4* y 4- 82 •« 2; X — 2y 4* 42 4 - 3io •* 4, 

3x — y -f 2 « 2. 
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2. Obtain non^ro solutioiiB when they exist. 
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(a) a; 4* 3y — 2* * 0, (&) x — 2y 0, 

2af — y4-ew0; 3a:-f|/-*0, 

2a: — 1 / -» 0; 

(c) 3a; — 23 / -I- * • 0, (d) 2a! — 4y 4- 3* «• 0, 

X + 2i/ — 22 «• 0, X 4- 2j/ — 22 » 0, 

2a; - y 4- 22 « 0; 3x - 2|/ + « « 0; 

(e) 4x - 2y 4- 2 - 0, (/) X 4- 2y 4- 22 - 0, 

2x — t/4*3z-*0, 3x-y4-z «0, 

2x ~ y — 2z «« 0, 2x -f Sy 4- 22 «* 0, 

6x - Sy 4* 4z « 0; x 4* 4y — 2z « 0. 

8 . Investigate the following systems and find solutions whenever the systems are 
consistent: 

(a) X - 2y « 3, (b) 2x + y - « * 1, 

2x4-y“l| x — 2y4“*®*3, 

3x — y « 4; 4x — 3y + * - 5; 

(c) 3x 4- 2y » 4, (d) 2x — y 4* 32 « 4, 

X - 3y 1, X 4- y - 32 « -1, 

2x 4- 5y « -I; 5x — y 4* 32 « 7. 

4 . (a) Give a necessary and sufficient condition that four points in space be coplazmr. 
(b) Give a necessary and sufficient condition that four planes be concurrent. 

5. As in Example 5, find a necessary and sufficient condition that four points lie on a 
circle. 

6. Give a relation which the coefficients must satisfy if 

ox* + 6x^ 4 cx 4* d « 0, 
ax* 4 /Sx 4 7 “ 0 

have a common root. 

7. Give a condition on the coefficients of a general cubic /(x) if it has a double root. 

and J\x) have a root in common. 

8 . The system ax 4 by « c, ax 4 /?y “ 7 represents two lines which may intersect 
at one point, may be parallel, or may coincide. Discuas the system geometrically, and 
thus obtain all the relevant results involving rank. Hint'. Begin by showing that the 
lines are paraDel if, and only if, the coefficient determinant is zero. 

9. An equation ox 4 by 4 “ d represents a plane, and two planes are parallel if, 

and only if, corresponding coefficients a, b, c are proportional: 

a « fcoi, h » fcbi, c » kci, 

(You may assume these geometric facts.) As in Prob. 8, give a complete geometric 
discussion of the behavior of two equations in three unknowns. 

10. As in Prob. 9, discuss the general system of three equations in three unknowns. 
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THE LAPLACE TRANSFORM 


The use of Laplace tranforms for solving ordinary differential equations 
has its origin in a symbolic method developed by the English engineer 
Oliver Heaviside. It enables one to solve many problems without going 
to the trouble of finding the general solution and then evaluating the arbi¬ 
trary constants. The procedure can be extended to systems of equations, 
to partial differential equations, and to integral equations, and it often 
yields results more readily than other techniques. 

1. Definition of the Laplace Transform. The function F{p) given by 

F{p)^rf(x)e-^^dx^L(f) (M) 

-'0 

is called the Laplace transform of /(x), and the operator L that transforms 
/ into F is called the Laplace transform operator, llie operator L is linear; 
that is, 

L{f+g)^L{f) + L{gL (L2) 

L(c/) = cL(/), (L3) 

where c is any constant. Indeed, the definition of L shows that (1-2) is 
equivalent to 

[ Ifi^) + dx - f f{x)e~^* dx + f g{x)e'~^^ dx 

JQ JQ Jq 

and this is a familiar property of integrals. The proof of (1-3) is similar. 

To illustrate the calculation of a Laplace transform let f{x) » e***, where 
a is constant. The transform is 

/ e^^e'^^dx = / «- 

J<y Jo — 

provided p > a. When p < a, the integral diverges. 

This example enables us to investigate the convergence of (1-1) for a 
general function/(x), provided 


(1-4) 
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f(z) is piecewise continuous ' on every finite interval 
and (1-5) 

\f{x) I < Me®* for some choice of the constants M and a. 

Under these conditions the integral converges for p > a, just as in the 
foregoing example. In fact, 

Jq dx < dx < M Jj’ dx. 

Since the latter integral has the finite value (1-4'i, the integral on the 
left remains bounded as Z oo. This establishes not only the conver¬ 
gence but the absolute convergence of the integral defining L(/). The 
convergence is uniform if p > oo > n, where oo is fixed, and hence the 
operations we shall carry out later are justified. 

The integral on the right of the foregoing inequality tends to zero as 
p —* 00 , This sliows that 

lim F{p) = 0 (1-6) 

P —>00 


for all functions F == L(J) such that / satisfies (1-5). It is found, more 
generally, that F(p) —> 0 if L(f) converges for any finite value p « po, 
even when (1-5) does not hold. Hence, if lirn F(p) 0 as p —> oo, then 
F{p) cannot be the I^^aplace transform of any function/(x). 

Example 1. Let/(x) « x*. The change of variable t « px yields 


f. 


x^e dx 




According to Chap. 2, Sec, 14, the latter integral is convergent for 6 > —1 and repre¬ 
sents the generalized factorial bl Hence 

L(x*) » for 6 > -1. (1-7) 

When b IS negative, j* is infinite at x « 0 and (1-5) does not hold. 

By comparing the integral for L(/) with that for L{Mx^) near the origin, one finds 
that (1-5) is really needed only for x > 1, provided /(x) is piecewise continuous for 
X > 0 and satisfies the additional condition 


|/(x) I < Mx* on 0 < X < 1 for some constant h > —1. 

Whenever we take a Laplace transform L(/) in the sequel, it is understood that p > a 
and that/ satisfies (1-5) or the more refined condition just descril>ed. On the other hand 
it is not required that/(x) be real For example, (1-4) holds when a is complex provided 

p > Re (a). 

' See Chap. 2, Sec. 26. The following discussion uses a comparison test for integrals, 
which can be verified in the same way as the corresponding tost for series. Cf. Chap. 2, 
Sec. 4, Theorem I, and Chap. 2, Sec. 6, Theorem I. 
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Bxampk 2. The choice a •• in (1>4) yields 


{APP. » 


!,(«**) • I,(oo8 6x + * sin hz) • 


|j — t6 

Upon equating real and imaginary parts with due regard to (1-2) we get 


P 

Is(oos bx) —z -L(sin hx) 

p* 4- 0 

for all real 6. Diflferentiation with respect to h gives 


L(x cos hx) 


p* -62 


h(x sin 6a;) 


6 

p‘ + 6* 
26p 


(p*+6*)» 


( 1 - 8 ) 


(1-0) 


Proceeding in this fashion one can construct a table of transforms, such as Table 2 
given at the end of this appendix. Indeed, we have already derived entries la, 2a, 26, 
36, and 4a of Table 2; and entry 3o can be obtained from (1-8) and (1-9), since L is linear. 


2. Some Uses of the Laplace Transform. If L[fix)] - F(p), integration 
by parts leads to 

Llf'ix)] ^ pF(p) - m (2-1) 

provided the hypothesis (1-5) applies to/'(a:) as well as tof(x). That is, 

OO 

f dx = e“*’*/(x) -f f pe^^^fix) dx. (2-2) 

'0 0 •'0 


For sufficiently large p Eq. (1-5) shows that er^'^fix) 0 as x —♦ oo, and 
the desired result follows. 

The choice/(x) ^ yin (2-1) gives 

L( 2 /') = pL{y) - y(0) (2-3) 

and the choice /(x) = y' gives 

L(y") =«= pL(y') — j/'(0) « p[pl>iy) - y{0)] - p'(0) 

in view of (2-3). Hence 

- py(0) - y\0). (2-4) 

The transform of the higher derivatives can be obtained similarly. For 
instance, 

t(y'") « p^L(y) - p^y(0) - pp'(O) ~ y'\0). (2-5) 

These relations enable us to solve differential equations with constant 
coefficients. 

As an illustration consider the problem 

+ y « fit), yiO) « |/'(0) «= 0, (2-6) 

which describes the response of a resonant circuit to an input /(/). To make 
the iproblem definite let/(i) » 0 for t < 0, but/(0 « 1 for ^ > 0. (A switch 
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to a coEkstant-voltage source is closed at time i « 0 and remains closed there¬ 
after.) The transform of (2-6) gives 

V^Liy) + Uv) 


when we use (2-4) and the entry la of Table 2 with a 
L(V), 

Uy) 


1 


p(p" +1) 


0 . Solving for 


It can be shown that a continuous function y is determined on (0,oo) 
as soon as its transform L(y) is known. Hence, the foregoing equation 
contains the solution implicitly. To find the solution explicitly we use 
partial fractions; thus 

V 1 P 

Uy) -- 

p p +1 


The entries la and 26 of Table 2 give the desired answer 

2/ = 1 — cos < for t > 0, y == 0 for i < 0. (2-7) 

It is an especial merit of the Laplace transform that the initial conditions 
are satisfied automatically. In the foregoing illustration we did not find 
the general solution and then determine the constants so that y(0) == y'(0) 
= 0. Nevertheless, the expression (2-7) satisfies these conditions, as the 
reader can verify. 

To illustrate further the introduction of initial conditions we shall solve 


2/'" - y' = sin X (2-8) 

subject to 

y{0) = 2, j^'(0) - 0, 2/"(0) « 1. (2-9) 

The Laplace transform of (2-8) yields 

p^Liy) - 2p^ - 1 - IpUy) - 2 ] == L(8in x) = {p^ + 1)“"^ 
when we use (2-5), (2-3), and entry 2a of Table 2. Solving for L(y), 


Uy) 

By oartial fractions 

Uy) - 


2p^ ~ 1 


+ 


(p*+i)(p® - p) 

3 P 


+ 


4(p + 1) ' 4(p - 1) ■ 2(p=* + 1) 
and entries la and 2i> of Table 2 give 

y ■» + H 008 *. 
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The Laplace transform can also be used to solve systems d. differential 
equations. As an illustration, let it be required to find y if 


y' + 2z' + y — 2 = 25, 

2y' + 2 == 25e*, 

with the initial conditions 

y(0) = 0, 2(0) = 25. 

The transform of (2-10) leads to 

pL(y) -f 2[pL(z) - 251 + Uu) - = 


25 


2pL(y) -f L(2) = 

which simplifies to 

(p 4- l)L(y) + (2p - 1)L(2) 


25 


p - 1 

25(2p + 1) 


Solving for L(y), we get 

L(y) 


P 

2pL{y) -f- L(2) = 25(p - 1)~‘. 
25 


4p(p - l)*(p + }4) 


( 2 - 10 ) 

(2-11) 


_ 25 9 5 16 

p p - 1 (p - 1)2 P+H 
According to’ entries la and 16 of Table 2, 

y = 25 - -f 5rc* - 


It should be not^ed that this method enables us to find y without finding 
2, Also no extraneous roots are introduced, and the initial conditions are 
satisfied automatically. 

PROBLEMS 


1. If y satisfies y" — 3y' -f 2y *» 4, y(0) « 2, y'(0) « 3, show that 


Uy) 


2p^ - 3p 4 
P(7» - 1)(P ~ 2) 


Deduce that y « 2 — 3c* -f 3c^. 

2. Solve by means of the Laplace transform 


V" + 4y « sin x, y(0) - 1, y'(0) - 0. 

8. Find L(y), and solve 


y-' + y^' « e* 4- X 4-1, y(0) - y^O) - y"(0) - 0, 
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4 . Find L(«) in Eq®. (2-10) and (2-11) of the text, and deduce that 
z » 33c* - 10x€* ~ 

6. Solve by means of the liaplace transform and check by substituting into the given 
system: 

X/' + 3?/ + 2 ' 4* 2z « c~^*, 2/(0) »« 0, 

2?/' 4- 22/ -f *' + 2 « 1, z(0) « 0. 

6. Find j/, given that 

y* ^ z' ^ z' w' w' 4- J/' *=» 2/, 2/(0) « z(0) « i/)(0) ■« 1. 


7. If /'(j) satisfies (1-5), show that/(x) satisfies a condition of the same type, though 
perhaps with a different value of a. Hint: f{x) *» f'(t) dt 4-/(0). 


3. Discontinuities. The Dirac Distribution. Closing a switch in an 
electrical circuit intrmiuces a discontinuity in the corresponding input 
function [cf. the discossion of (2-6)]. A disconti¬ 
nuity may also be produced by a sudden impulse 
in a mechanical system. The Laplace transform 
is a most effective means of dealing with such situ¬ 
ations, because the transform of many discontinu¬ 
ous functions is just as simple as L(c*) or L(sin x). 

Tn thivS section we shall consider the response 
of a system to an impulse function which acts 
over a very short time interval but produces a 
large effect. The physical situation is typified by 
a lightning stroke on a transmission line or by a 
hammer blow on a mechanical system. 

To formulate the idea of an impulse, let a be 
a small positive constant and let 6a(x) be the function illustrated in 
Fig. 1. That is, 

6o(x) = for 0 < X < a 

and 5o(x) =» 0 elsewhere. The Laplace transform is 



L[5«W] = - {par\l - ern. 

JO 


By the Taylor series for e 

I^l^a(x)] = 1 — }^(pa) d-1 

as a 0. It is customary to introduce an expression 6(x) which is thought 
to be the limit of 6a (x) as a —♦ 0 and to say that 

L[6(x)] - 1. (34) 

We call 6(x) the Dtrac distribution or the unit impulse^ and we take (3-1) 
as the basic defining property. The legitimacy of this procedure requires 
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further discussion, which will be given presently. First, however, the use 
of 5(x) will be illustrated by an example. 

The displacement y of b, weight suspended by a spring with stiffness 1 
is determined by the system 

y'" + y ^ fit), y{0) « y'(0) « 0, 

where f(t) » force function 
t == time 
' * d/dL 

To determine the response to a unit impulse at ^ 0 we replace f(t) by 

5(^); thus 

y" + y ^ 5 ( 0 . 

The Laplace transform yields 

p“L(s,) + L(y) = L[i(0] = 1 
when we use (3-1). Hence L(y) «!/(!+ p^), or 

y » sin t > 0. 

The initial conditions require y = 0 for ^ < 0, and the graph has the ap¬ 
pearance illustrated in the accompanying Fig. 2. 

The function y is continuous, 
but it is not differentiable at 
t ~ 0. Thus, the initial condition 
y\0) = 0 is satisfied. Indeed, 
y'(t) -*-> 0 as < 0 through nega¬ 

tive values, but y'ii) = cos t 1 
as f 0 through positive val¬ 
ues. The unit impulse produces 
a jump, of magnitude 1, in y\t). 

To investigate the meaning of the foregoing result, we solve 

y" + y ^ Sa(0, y(o) »» y'(0) - o 

and then let o —► 0. The general solution is 

y *« Co sin f + Cl cos f, f < 0, 
y “cjsinf-hcjcosf-f- a~\ 0 < t < a, 

y •» C 4 sm ( 4- Cft cos f, i> a. 

By the initial conditions, 

cq M Pi « 0, C8 «• —• 

a 

To determine ci and c% we require that y and y' be continuous at f « a. This gives 
— cos a 4“ * C4 sin a 4- ci cos a, 

8 ina««C 40 oea-<-C( 8 ma. 



Fio. 2 
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Hence Ci » sin cs •>» ©""Hco** "• 1). <««* aolutaon is 

y - 0, « < 0, 

y *« — oofl 0, 0 < « < a, 

y « 0 “^ sin o sin i — a~Hl ~ cos a) cos f, a <t 

Since a"*Kl •“ co® 0 S a~^(l “-oosa)”->0a8o~+0, and since a“^ sin a 1 as 
a 0, we see that letting o 0 gives the solution which was obtained previously by 
the method of Laplace transforms. 

Although 3(rr) is often called the ‘‘Dirac delta function,” it is not a 
function. Indeed, we have already observed that L(/) —► 0 as p oo for 
every function /, and 5 doe^ not have this prof>erty, because L(5) » 1. 
It is possible to generalize the concept of function and to generalize, cor^ 
respondingly, the definition of L. The process leads to a branch of mathe¬ 
matics known as the theory of distributions} In this theory manipulations 
with 6(x) of the type carried out in the foregoing discussion are fully 
justified. 

Although a brief and correct definition of the unit impulse 8{x) is not 
easily given, it is easy to define what is meant by the response of a system 
to the unit impulse. Namely, find the response to the function 5a(^), as 
in the foregoing example, and then let a 0. The Laplace transform 
gives the result of such a calculation directly, without introduction of 

PROBLEMS 

1. The volUge F of a certain circuit satisfies 

V" + 4F' + 3F » E{t) 

where E is the applied voltage. Find the response of the system to a unit impulse at 
f « 0 if F ■» 0 for ( < 0. 

2. (a) Solve the equations 

y' « Kx), y" « 6(x), 2 /'" » d{x) 

assuming that y «* 0 for x < 0 and that y and as many derivatives as possible are con¬ 
tinuous. (6) Show that y, y', and y" have a jump of value 1 at x « 0 in the three cases, 
respectively. 

3. A certain function f/(x) satisfies 

-fU = -yiKxh x>0 

where a and 0 are positive constants. It is known further that C/f—x) *• U(x)^ U is 
continuous, and 1/ —> 0 as x —♦ «». Obtain the solution 

Hint: In forming L(U"), take U(0) *» c, U'(0) « 0 where c is a suitably chosen constant 

^ L. Schwartz, th^orie de distribution,” Hermann & Cie, Paris, 1950, See also 
B. Friedman, ‘Principles and Techniques of Applied Mathematics,” chap. 3, John 
Wiley & Sons, Inc., New York, 1956. 
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4 . Tlie singiilar solution u(x,<) far heat conduction satasfies 
tt< w{*,0) - HKx). 

(a) If V(^,P) “ I'M# where the transform is with respect to t, show that — pV 
<■ — (f>) Using the result of Prob. 3 followed by Table 2, deduce that 


u(.x,t) 


_ A—xV (4a*0 

2a(rt)^ 


Hint: The role taken by in the table is taken by t in this problem. 


4. Additional Properties of the Transform. The usefulness of the La¬ 
place transform is greatly increased by the properties tabulated in Table 1. 
Entries la, lb, and 4a were derived in the foregoing discussion, and the 
others will be derived now. To deduce the relation 2a we have 


l^fix - c)e-^^dx = dt = e-’”’dt 

upon setting i ^ x — c. The limits (—^,00) can be changed to (0,co) if 
f{t) « 0 on the interval (—c,0), and 2a follows. In particular, 2a holds 
If c S: 0 and f(t) = 0 for < < 0. The relation 2b is simply the identity 

dx = Te-^V^/Cx) dx. 

Jo Jo 

This is valid without restriction on c, provided p is large enough. 

For 3a we let ( ^ cx to obtain 

jj{cx)e-^* dx = jj{t)e-^^'‘'^‘d Q = ^ ^ Q 

as desired, provided c > 0. Writing 1/c instead of c in 3a gives 3b, again 
for c > 0. 

The result 4b follows by differentiating (1-1). For 5a we apply 4a to 
the function 

/i(x) = [7(0 dt, 

Jo 

noting that /i(0) - 0 and that /i(x) = f(x) at points of continuity. The 
result 5b follows from integration of (1-1). 

The convolution theorem, item 6 in Table 1, can be established by the 
following device: Since the Laplace transform involves f{x) only on the 
range (0,«>), we can agree to take f{x) = 0 for all negative x. With a 
similar convention for g(x), the respective Laplace transforms may be 
written in the form ^ 

. ‘ Transforms of the type (4-1) are called bilateral, in contrast to the unilateral trans¬ 
form (l-l). An account of the bilateral Laplace transform may be found in B. Van 
der Pol and H. Bremmer, ^‘Operational Calculus,” Cambridge University Press, London, 
1950. 
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L(/) - /* dx, L(ff) - r e-^gix) dx, (4-1) 

and the function A(x) of Table 1, entry 6, is equal to 

Kx) ~ r mg{x - & dl (4-2) 

» —00 

Indeed, tlie lower limit —oo in (4-2) may be replaced by ssero because/({) 
« 0 when f is negative, and the upper limit may be replaced by x because 
gix — {) 0 when x £ is negative. Given (4-1) and (4-2), the convolu¬ 

tion theorem L{h) = L(J)L(g) can be proved by a discussion which is 
practically identical with a discussion given previously, and hence we do 
not repeat the argument here.^ 

Example: Periodic Functions. Let Poix) be the function illustrated in Fig. 3, bo that 
Pq(x) « 1 for 0 < X < a, Poix) « 0 elsewhere. 

Direct computation gives the transform 

LlFo(x)l « r e-^dx ^ p'-\\ - 

h 

If the function is translated c units to the 
right, as shown in Fig. 4, the result is 

UPo(x ~ c)l « (4-3) 

by Table 1, entry 2a. Upon choosing c « 0, 
c -» 6, c « 26, c » 36, ... and adding, we get 
a square wave * y « F(x). According to (4-3) 
the Laplace transform is 

LlP(ar)J = p-'(l - + e"*** + e-’r* +•.-)- p-» 

when we recall the formula 1/(1 r) for sum of a geometric series (Chap. 2, Sec. 1). 




^ See Chap. 6, Sec. 18. In the present case the integrals are absolutely convergent, 
the change in order of integration is justified, and the process actually gives a valid 
proof. 

* See Fig. 5. It is left for the reader to sketch the graph when 6 «■ a and when 6 < a. 
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The piocedure just described can be applied to any periodic function P(x) and yields 
the formula 

LlPWl « (1 - L(Po(x)l 

where b is the period and where Foix) « P(x) on 0 < x < b but Po(x) - 0 elsewhere. 



For example^ the reader can verify that the transform of the sawtooth imve shown in 
Fig. 6 is 

(1 ~ - e*-*P(l + bp)]. (4-4) 



PROBLEMS 

!• Find a function f(x) whose transform is 

2p - 5 

3p* + 12p + 8* 

Hint: By completing the square the expression can be written in the form 

2p - 5 2(p + 2) - 9 2 p + 2 3 

3(p -f 2)* - 4 " 3(p H- 2)2 - 4 * 3 (p -f 2)* - ^ (p -f 2)* - 

Use Table 2, entries 2o and 26, with a «• 2i/y/S (see also Table 2, entry 7). Then 
use Table 1, entry 25, with c « —2. 

2 . Aj in Prob. 1 find a function whose transform is 

8. Derive Table 2, entry 15, from Table 2, entry 4o. 
i. Derive Table 2, 6o, from Table 2, 66, and Table I, 46. 

5, Derive Table 2. 46, from Table 2, 2a, and Table 1, 66, 
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B. aad j{*) »• i*“‘, rf»ow that 

f*g m. x«+»-i («. 


Let £ ia; in the definition otf^g (Table 1, entry 6). 

7. From Prob. fi deduce the Evler formula for the beta function 





(a - l)!(b 1)1 

(0 + 6-1)! 


Hint: By the convolution theorem applied to the result of Prob. 6, 


6. Steady-state Solutions. The Laplace transform will now be used to 
solve the general linear equation with coastant coefficients, 

H-h aiy' + Ooy = f{x), (5-1) 


subject to the initial conditions 

y(0) « 0, y'(0) « 0, ..= 0. (5-2) 

The solution satisfying (5-2) is called the steady-state solution of (5-1), 
because in many physical problems the effect of the initial conditions decays 
exponentially as x increases. 

By repeated use of Table 1, entry 4a, 

L[y<*>] = p'‘Uy), (6^) 

for fc = 0, 1, 2, ..., n, provided (5-2) holds. Hence the transform of (5-1) 
yields G(p)L(y) = L(Jf), or 

L(y) = “ Uf), (5-4) 

G(p) 

where G(p) = p“ -f o„_ip"“^ -1-f- aiP + ao- 

Determination of y from (5-4) is especially easy when G(p) has only simple 
roots pk 9^ 0. Indeed, expanding l/(?(p) in partial fractions leads to 

L(y) = L(/)S-^ (5-5) 

P - pk 

where the AkS are constant.* Since Table 2, entry la, gives 

2 - AL - = SA*L(e'’»*) = L(2.4jfce»’**), 

P -Vk 

Eq. (5-6) may be written 

L(p) « L(/)L(SAte*'**). 


> If we multiply through by p — p* and let p p*, it is found that l/A* - 0'(pu). 
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CcBBpariiig with Table 1, entiy 6, gives Heaviside’s ecpansim theorem 

y df. (6-6) 

The eeaenoe of the method is that (5-4) leads to 

Uy) - L(/)Lfe) 

provided is a function such that l/0(p) * hig). By the convolution theorem, 
y ’‘S*g - j^M)g(v - & di- 

This formula is valid even when G{p) has multiple roots, though the determination of 
g may then be more difficult. 

The function g can be thought to be the steadynatate solution of 

g^*^ -{- “1-h uu?' -f ooflF - 5(a:) (5-7) 

because the transform of (5-7) yields Gip)'L{g) » 1. However, since we have not de¬ 
veloped the theory of distributions, it is better to avoid the use of &{x). This question 
will be discussed next. 


Let h{x) be the steady-state solution of 

fc'"’ + + • • • + a,ft' + ooft = Hx) 

where 7(x) denotes the Heaviside unit function: 

I(x) >“0 forx < 0, /(i) = 1 forx > 0. 

The Laplace transform of (5-8) yields G(p)L(ft) = 1/p, so that 


L(ft) 


1 

pG(,p) 


Writing (5-4) in the form 


My) pL(/) 


we obtain 


pG{p) 


L(y) = [L(/') -h/(0)]L(ft) 


= pL(/)L(ft) 
L(/')L(ft) -f/(0)L(ft) 


(5-8) 


by Table 1, entry 4a. The convolution theorem now yields 

y - l^nmx - f) + /(0)ft(x). (5-9) 

Thus, the eteady-‘Staie solution of (5-1) can be obtained from the steady-state 
solution of (5-8) by means of the formula (5-9). This important fact is 
known as the mperposition principle. 
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As in the derivation of (2-4) one can show that 

L(j,c*)) « _ p*~yO) ^ - 

By means of this formula the Laplace transform can be used to solve (5-1) subject to 
the general initial conditions 

v(0) •* yo, y'(0) - yi, ..y^*"“^^(0) - y^^i. 

It should be emphasized, however, that the superposition principle applies to steady- 
state solutions only. 


PROBLEMS 


1. Find the steady-state solution of 

y"-b3y'4'2y-./(x) 

by use of Heaviside’s expansion theorem. 

2. Evaluate the result of Prob. 1 explicitly when 

(a)f(x) - I(x), (b)f(x) - ««»•, (c)f(z) - X. 

3. By means of the superposition principle obtain the solutions (6) and (c) in Prob. 2 
from the solution (o). 

6. Integral Equations, An equation of the type 

g(x) = X/(x) +jj{i)k{x - i) rff (■►>!) 


where X is constant is called an integral equation. It is supposed that g 
and k are known and that / is to be found. Because of its close relation 
to the convolution theorem, tliis equation lends itself to analysis by means 
of the Laplace transform. Indeed, taking the transform of (6-1) yields 


Uq) * XL(/) + UDUk) 

when we use the convolution theo¬ 
rem. Hence 

Ug) 


W) 


X “j“ L(/c) 


and from this, f can often be found. 

The process will now be illustrated 
by an example. 

Starting from rest, a particle slides do^m 
a frictionless curve under gravity (see Fig. 

7). It is required to determine the shape of 

the curve so that the time of descent will be independent of the starting point. 
A curve of this sort is called a taulochrme. As we shall see presently, the only 
tautoohrones are cycloids.* 



* For another interesting property of the cycloid see Chap. 3, Sec. 14, Prob. 3. 
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If the pturtide starts at a hdg^t v> its -veiooHy p when the is yi •> 9 can be 
found by equating potential and kinetic energies. The result is 

— ij) or e *« W)^(y tj)^ (6-2) 

where g is the acceleration of gravity. Denoting the arc along the curve by «, we see 
that the time for descent is 

where /(n) stands for ds/dy at y * v- Since the timeJs constant and since v is given 
by (^2), the problem reduces to 

^/(v)(l/ - dll Co 

where cq is constant. This is an integral equation for /. 

Taking the Laplace transform gives L(/)L(y"~^) *■ L(cc), or 

- cop-'. 

This gives h(f) ■■ where ci is constant, and hence /(y) •« cyT^, where c is con¬ 

stant. Thus we are led to the differential equation 

If we set y a* c* sin* a short calculation yields 

z « }4f^(<t> + sin 4>)i y “* }^(l - cos <t>) 
which are the parametric equations of a cycloid. 


REVIEW PROBLEMS 

1. The current / in an RL circuit satisfies 

L^I + m.v 

where V « V(i) is the applied voltage. At time < «■ 0 a switch is closed, so that V 
suddenly assumes the value Fo + A sin (Here L, R, F©, A , and w are constants.) 
By use of the Laplace transform find / for ^ > 0. 

2. Find the response of the circuit in Prob. I to a unit impulse at time t « 0, assuming 
that F « 0 for i < 0. 

3. Find the steady-state solution in Prob. 1 when F is an arbitrary function by (a) 
the Heaviside expansion theorem, (6) the superposition principle. 

4. If L(y) « F(p) use Table 1, entry 4h, to obtain 

L(xy) • -F', L(a;y') « -(pF)\ L(xy") - + y(0). 

6. A function y satisfies a;y" 4* 1/' 4- ify “ 0 and has a Laplace transform L(y) » F(p), 
By use of Prob. 4 show that 

F'(l+P*)- -pF, 
and thus deduce that y cJo(x) where c is constant. 
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0. An inBulated red extending along the positive x axis is initially at temperature 0, 
and the end » * 0 has the temperature fit) at time L The temperature uixfi satisfies 

Ut « uixfi) «• 0, ti(0,0 « fit). 

(o) If Uix^p) is the transform of u with respect to t, show that 

U - 

[Assume that t/ —► 0 as x —► and note that C/i0,p) « L(/).] 

( 6 ) By writing U « L(/)L(^), where g is found from Table 2, deduce that 


uix,t) 


2 a\/ 


^/Wdr. 


r Jo (< — t )^ 


7. Use the Laplace transform to solve some of the text examples and problems in 
Chap. 1 , Secs. 21 to 26. 


Table 1. Pbopekties of L[/(x)} « Fip) 


i 

1 

a 

b 

1 

Uf + (?) = L(/) + tig) 

Uef) = cUf) 

2 

L(/(i - f)J - e-^‘F(p) 

Fip -c)~ L[e«/(x)) 

3 

Ll/(cx)] =■ J f (5 

rw-l[!/0] 

4 

L[/'(x)) - pF{p) -m 

F'{p) - t[-xf(x)] 

5 



6 

L(/)L( 0 ) - L(?i) where h{x) - [/(OsCx - £) di 

Jo 












APPENDIX C 


COMPARISON OF THE RIEMANN AND 
LEBESGUE INTEGRALS 


1* The Riemann Integral. Let a function f(z) be given on the interval 
< a: < 6 (Fig. 1). To define the Riemann integral 

jj{x)dx (1-1) 

we divide the interval [o,6] into smaDer intervals by points x*, 

a = Xq < Xi < X2 ■ - < Xn h. 



It will be desirable to consider a sequence of subdivisions which are naade 
finer and finer by clioosing more and more points x*. The precise require¬ 


ment 18 


and 


max 

k 


Xk - Xk-^i\ 


0 . 


To describe this situation we say, in brief, that the subdivision becomes 
arbitrarily fine. 

Let ik be an arbitrary point on the interval With yk « /({*) 

as shown in Fig. 1, the sum 

8 = 2/1 (Xi - Xo) + 2/2(X2 - II) H -1- Vnixn ~ X^-l) (1-2) 

771 
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represents a certain area that presumably approximates the area under the 
curve y f{x). The geometric interpretation suggests that s has a unique 
limit So, independent of the manner of subdivision, provided the sub¬ 
division becomes arbitrarily fine. When s 
actually does have this behavior, f{x) is 
said to be Riemann integrablef and the 
limit So is called the Riemann integral. 

The Riemann integral does not exist if f(x) 
oscillates too violently. For example, let a «« 0, 
6 « 1 and define 

/(x) » 2 for X rational 

(1-3) 

/(x) 3 for X irrational. 

It is easily shown that every interval (no matter 
how small) contains both rational and irrational 
numbers, so that the graph of /(x) has the appear¬ 
ance suggested by Fig. 2. If we choose rational, 
then f{^k) ** 2 and 

8 « 2(Xi - Xo) -f 2(X2 ~Xi) ■+•••« 

-f 2(x„ - Xn_i) 2(x„ - Xo) « 2 

no matter how fine the subdivision may be. 
On the other hand, if the kks are all irrational, 
then 8 « 3. This shows that the limit of s depends 
on the manlier of subdivision and, hence, that the 
Riemann integral does not exist. As we shall see 
presently, the Lebesgue integral for tliis function 
does exist and can be evaluated explicitly. 

2. Measure. The decisive idea in the Lebesgue integral is the notion 
of measuref which will now be described. The measure of an open * interval 
a < X Kbis simply the length h — a. If a set consists of a finite collection 
of such intervals (Fig. 3), the measure is the sum of the lengtlis. The 



Fig. 3 

same definition applies when there are infinitely many intervals. The 
sum of the lengths is now an infinite series, but since the terms are positive, 
the sum does not depend on the order of the terms (Chap. 2, Sec. 6, Theorem 
III). Thus, the measure is well defined in this case also. 

^ A rational number is a fraction p/g where p and q are integers. Thus }i and 
are rational, but VS is not. 

* An intervcd is open if the end points do not belong to the interval and dosed if they 
do. Thus a < X ^ h is a closed interval. 
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The DotioD of measure can be extended to still more g^eral sets E 
as follows. Let / be a collection of open intervals which contains • E, 
and let m(J) denote the measure of 1. We approximate E better and better 
by these sets I, so that m(/) becomes smaller and smaller. The smallest 
value for m(/) which is given by this process is called the outer measure 
of E and is denoted by 7no(E). 

Strictly speaking, the “smallest value” need not be attained, and the precise definition 
of outer measure is as follows: The outer measure is the largest number c such that 
m(/) > c for all sets I of the above-described type. The number c is called the greatest 
lower bound of the numbers m(/); its existence can be established by the fundamental 
principle quoted in Chap. 2, Sec. 1. 

A collection of open intervals, such as / in the foregoing discussion, is 
called an open set. As we have seen, outer measure is defined by consider¬ 
ing the open sets containing E. The points of [tt,h] not belonging to a given 
open set form a closed set. By considering closed sets contained in E one 
can define the inner measure m^(E). If mi(E) ~ moiE), the set E is said 
to be measurable and the common value is called the measure of E. 

To illustrate the calculation of a measure, let the set E consist of the rational points 
a? on 0 < X < 1, that is, the points whose coordinate z is a rational number. By taking 
first the rational numbers p/q with denonunator ^ — 1 , then those with 5 « 2 , and 
80 on, we see that the rational numbers can be arranged in a sequence 

ri, r% rs, ..., r„,- (2-1) 

Given € > 0, construct an open interval of length t/2 centered at ri, an interval of 
length 6/2^ centered at r 2 , and so on. Th^ nth interval is of length e/2" and is centered 
at rn. If I denotes the set consisting of all these open intervals, then 

m(/) +(2-2) 

[We have inequality rather than equality in (2-2) because some of the intervals may 
overlap.] 

The foregoing construction shows that the outer measure of £* is < e. Since e is 
arbitrary, the outer measure must be zero. Because mx(E) ^ mo(E)y it follows that the 
inner measure is also zero and, hence, m{E) «« 0. 

As a second illustration we shall find where E^ is the set of all irrational numbers 
on [0,1]. On© of the most important properties of measure is that it is additive; if E 
and E' are two measurable sets with no point in common, then 

m{E -f E') - m(E) + m{E'). 

(We use E -h E' as an abbreviation for the set of all the points belonging either to B 
or to E\) In the present case E is the set of rational points on [0,1], and E' the set of 
irrational points on [0,1]. Evidently E + E' is the set of ail points on [0,1], so that 
m{E + EO * 1. The above equation then gives 

m(E') » 1 — m(E) « 1 — 0 »» 1. 

' That is, every point of E is interior to one of the intervals belonging to the set /. 
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[app. c 


S* The liebesgue Integral, A function y ^ fix) is said to be measurable 
if the set of points x at which f{x) < cis measurable for any and all choices 
of the constant c. It can be shown that the set Sfc at which <f(x) < yu 
is then meaaurable for all choices of yk^x and j / a *. To define the Lebesgue 
integral of /(x), let the y axis be subdivided by points yk as shown in Fig. 4, 
and form the sum 


(T » yim{ei) + y2m{e2) H-h 



When fix) is measurable and bounded, the sum or has a unique limit ao, 
independent of the manner of subdivision, provided the subdivision be¬ 
comes arbitrarily fine. This limit (tq is called the Lebesgue integral of f(x) 
and is written in the form (1-1). 

The most obvious difference between Riemann's definition and I^e- 
besgue’s is that in the former the x axis and in the latter the y axis is sub¬ 
divided. This distinction, however, is superficial. The important fact is 
that Riemann^s definition is based on the notion length of an interval 
whereas Lebesgue^s is based on the more general notion, measure of a set. 
The intervals Xk — in Riemann^s definition play the same role as the 
sets Bk in Lebesgue's. 

Riemann^s definition breaks down if f{x) does not remain close to yk 
throughout most of the intervals [xjb_i,Xfc]. Lebesgue’s definition cannot 
break down in this way, because f(x) is automatically close to yk through¬ 
out the set Bk, That is why (in contrast to the former definition) the 
latter carries with it an assertion that the integral actually exists. 
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SEC. 3] 

To illustrate the calculation of a Lebesgue integral we shall integrate the function 
(1-3) illustrated in Fig. 2. If the intervals (Vp^hVp) and (yg^uVg) contain 2 and 3, 
respectively (Fig. 5), then the sets ep and are the only ones that are not empty. Thus 
m(ejk) ^ 0 for k p QT q, and the sum reduces to 

<r « Vptniep) -f ygm(eg). 

Since ep is the set of rational points and the set of irrational points, these sets have 
the measures 0 and 1, respectively. Hence cr As the subdivision becomes 

arbitrarily fine, i/g —► 3 and the Lebesgue integral is found to be 


f\(x) dx - 3. (3-1) 

Jo 

It can be shown that if the Riemann 
integral exists, then the Lebesgue integral 
exists also and the two have the same 
value. On the other hand, the latter may 
exist when the former does not, as we 
have just seen. Because of its greater 
generality the Lebesgue integral has 
many desirable properties, of which we 
mention the following: 

Lebesgue Theorem on Bounded Con¬ 
vergence. Suppose l/n(T) I < M where 
M is constantf suppose fnM are Lebesgue 
integrahle, and suppose lim fnix) = f{x) on 
an interval [a,5]. Then fix) is Lcbesgiie 
integrahle^ and 

fh rb 

lira / fn{x) dx = I f(x) dx. 

Ja Ja 

To see why the theorem fails for Riemann 
integrals, let /„(x) « 2 at the first n rational 
points Tk in the sequence (2-1) and /n(x) « 3 
elsewhere. Then |/n(x) I < 3, and as a Riemann 
or Lebesgue inUigral, 

f/n(x) dx * 3. (3-2) 

Jo 



Fig. 5 


Evidently, lim/n(x) “/(x), where/(x) is the function (1-3). Taking the limit of the 
expression (3-2) as n —> 00 , we get 

Urn f/«(x) dx - 3 - f/(x) dx (3-3) 

Jo Jo 


provided the latter integral is the Lebesgue integral (3-1). Equation (3-3) does not 
hold for Riemann integration because, as we have seen, /(x) is not Riemann integrable. 
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TABLE OF *(*) * f’ <«' 



X 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 


.03X9 0.0359 
.0714 0.0763 
.1103 0.1141 
.1480 0.1517 
.1844 0.1879 
.2190 0.2224 

.2517 0.2549 
.2823 0.2852 
.3106 0.3133 
.3365 0.3389 
.3599 0.3621 

.3810 0.3830 
.3997 0 4015 
.4162 0.4177 
.4306 0.4319 
.4429 0.4441 

4535 0.4545 
.4625 0.4633 
.4699 0.4706 
.4761 0.4767 
.4812 0.4817 



0,4192433 

0.4452007 

0.4640697 

0.4918025 

0.4953388 

0,4074449 

0.4996631 

0.4998409 

0.4999277 

0.4999946 

0.4999979 

0.4999992 


* This table is reproduced by permission from the “Biometrica Tables for Statisti¬ 
cians,” vol, 1, 1954, edited by E. 8. Pearson and H. O- Hartley and published by the 
Cambridge Univeiqpity Press for the Biometrics Trustees. 
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ANSWERS 


CHAPTER 1 

Section 1, Pages 10-11 

1 . Ordinary, fourth order. 2 . Partial, fourth order, 3. Ordinary, first order. 

4. Ordinary, first order. 6 , Ordinary, seeond order. 6 . Partial, second order. 

7. Ordinary, seeond order. 8 . Ordinary, tliird order. 

14. y * y « -f 1; 2/ « ~ 2. 

X® x^ 

15. y « ^ 4- a:; y « 4- dx + C2. 

o o 

Section 3, Pages 13-14 

1 . 

B, p ^ -k; a * kc/{k\ — k), 

30 

6. V « SOe""**; s « (1 — 

K 

7. 2 m, 

Section 4, Page 16 

2. The rate q would be thought of s^sfiDq instead, where f(t) «= 0 for 0 < < < io and 
fit) » 1 for t > Equation (4-5) would be written dx/dt «= wr -j- fit)q — rx/g. 

3. dx/dt « w -f kAoe-^^ - rx/g, 6 . (A - x)/{B - x) » 

6. Let X represent amount of substance dissolved after time f, A the amount of sub¬ 
stance present when x * 0, f »* 0, and c the proportionality constant. If v is the 
volume of solvent and S the saturate concentration, then dx/dt »« ciA — x) X 
(S — x/v) if the dissolving substance does not change the volume v of the solvent. 

Section 6 , Page 18 

1. sin”"^ X — sin "^ y ^ c. 2. (y — ])/(y + 1) =* 

3. 2 cos y — sin X cos x -f- x « c. 4. (sec x — tan y) « 0, 

(X.y) 

» 0. 6. (y + + 1) « 2. 

( 0 , 1 ) 




Section 7, Page 20 




3. ain - *f log X « c. 
z 


2. sin”^ - — log X — c. 

X 

4. I* - 2xv - ir» - -2. 
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ANSWEBS 


«. logy -f: 


6. y ~ a: -f y log X - 0, 


7, 2 tan”** (e*0 -f log tanh; 

* 

9. » - 

11. log X -f c*“*'^* « c. 

Section Page 22 

1. «* + « + y ” c. 

xy » c, 

7, Not exact. 

10. Not exact. 

Section 9, Page 2B 

7. y/z + X » c. 


8. y « 

10, 1 — 1 _ log y « c. 
y 

12. y(2 - log y) » tan® x -f* c. 


2. Not exact. 

5. sin (y/x) *■ c. 
8. X® + sin xy » 


8. X = cye*^ 


10. y — 2 tan""* (x/y) « c. 11. (x/y)e^ 
Section 10, Page 26 

1. 1 + Vi* + 1 - cxe-o'^'. 

8. V - e-^(x - 1). 

6. y *■ cos® X + 2(sin z — 1). 


8. y »• sin X -f ce*, 

10. X - 1 -f 

12. y « 

18. X sin""* X 4* V^l — x® 

Section 11, Page 27 

1. y « cic®*; y » cje""*. 
— 2 

8. y « —r— ; X « 2p 


8. xV — xy® »« c. 

6. X® 4- y* ** c. 

9. x®y 4“ xy® + X 


9. yx® *» ce*'. 

12. y 4- x®/y « c. 


o 

• ® “ 3(1* + !)■ 

4. V - 1 - 2e-**'* 

2 . 

6. y ** 2 sin X — X cos x + ~ cos x 4” • 
9. * - ce(i/i)V5:^_ 

11. x(l + V)” » c. 


- «!»(V - 1) - 0. 


2. y ~ X + a; y ^ X + C 2 . 


4. » - log (p* + 2p); 2 - - ^ ^ tan ‘ + c. 

c®x 4" I n. « • 1 

g, y --; y ■» 2 Vx. 6. sm”*^ y rfc x « c. 

c 

7. y «» ce*; y « c — X®. 8. y - c* -f c; y •« c - x®/2. 

Section 12, Pages 28-29 

1. y* *• (48x"*® — 96x"“^ — 4) cos x 4- (16x~* — 96x~®) sin x 4- cx“~*. 

2. y""® «i X 4- H + c<5®*. 8. y“^ - 4~ ex®. 

4t. z y log cz. 6. y""* « 1 4- log x -f ca?. 


8. — -3 
ax 


7 « — — ;x«u~3, y-p4“4. 

au 2u 4“ 


■; » 3x 4- y “f 7. 


. tt — s 

' sm —;— ; X 


w — -1 y « 

2 2 
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10 . 


du 


— I *• cos w; w ^ a: 4-!/. 
' -f 


dx 
12. 2x 

14, a; sec y «• log I sec y + tan y 1 + c. 


11 . a: 2 « y ^ ^ ^ 


16, tan"*^ y — tan“ 
19. e*-“v « c X. 
28. 1/ « + ce®*. 


x ** c. 17. 4x1/ -f j/^ 


13. y « log (x?/ - c). 
15. ytan~^ 

« c. 


20 . 

24. 


- 2/ -h + ce**'. 

2x2c*' + cx2. 


X ■» c. 

18. xsin2^ 


21. 4x « 2j/ - 1 4- ce-“2v. 


Sectioa 14, Page 33 

1. y » CX. 2. x2 — j/* ■« C. 

3. x2 4 ny2 » c. 6. i9 « c. 

9. Self-orthogonal family. 

10. x2 — 2ax 4 s/^ “ 0; 2xyy' 4 x^ -> « O; a family of orthogonal riirvea is 

x2 - 2ay 4 ?/* “ 0. 


Section 16, Page 34 

1. (a) y « cx; (6) r- 4 i/ « r^. 

2. (a) 0 ± z - Vi- y^ - loK |(1 + Vl -"/Vi,); (h) v - re'. 

3. (a) j/ « (^) y « cosh x. 

4. s/'+'o _ c(z + Vz^TV)' 6. t =. |xo|6/{fc= - a^); h|xu' - .. 

Section 19, Page 46 

1. t «* 100/^ sec. 

6. tan $ »» tan 0o 4 2eE/(carnoc). 


Section 20, Pages 49-50 

2. V » V2j//i, s « > 2 ( 7^2 sin 
4. t; « tJi(l - (t>M)2 « 1 


6. y 


2vq cos^ a 


x* 4 a: tan a. 




Section 21, Page 54 

2. y « —c' 4 1/ «» 0. 4. y —2x2 _j_ ^ 


6. 3/ ■» c2* -j- 2x62*^; y « 0. 


Section 22, Page 56 

1. y » 4 Cue®*. 

4l. y c\^ 4 C2e“"2®. 
6. y a* cic** 4 C2xe2*. 


2. y *» cjc®* 4 3. y » cje* 4 caxc*. 

b, y ^ c\ cos 2x 4 C 2 sin 2x. 

7. y =“ cie^' cv)8 x 4 C 2 e^* sin x. 


Section 23, Page 68 

1. y « cic“* 4 2. y 

4. y »• cie®* 4 w** 6. y 

Section 24, Page 63 

1. y • ci«»* 4 cjje** 4 

3. y •» cie*”** 4 C 2 €"'** 4 Hac*- 

5. y « cic* 4 C 2 C“"* — 5x 4 2. 

7. y « (ci 4 C 2 x)e® 4 xV/6. 

9. y * Cl 4 4 x/3. 


f ic* 4 C2.e 3. y *s cie^ 4 C 2 c”^. 

cie“* 4 C2xe”"*. 

2. y « (ci 4 C 2 x)c‘“* 4 X - 2. 

4. y « (ci 4 C 2 x)c* 4x42. 

6 . y « ci€^ 4 C 2 C""® 4 c**(x/3 ~ K). 

8. y « cic®^ 4 cjxc** 4 x^c**/2. 
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10. y 

11. y 

13. y 

14. y 

le. y 

18 * y 
20. y 
22, y 


ANSWBRB 

9a;2 _ jgj. ^ 7 

•»» Cl sin Sa? -f c* coa 8 a; H-jr;- 

81 

« cic* + Cie'~* 4“ a»*/2. 12. y « 

a* cic®* 4- 4 c*® f -a;® — 3a;^ — 6x 



/X® x®\ 


— ci«* + + e* 


16. y 


13x® 24x 


.i, 

- Cl + «*«“ —- - 


17. y 

5 

25 125 


- 1 - 

X. 

10. y 

- (3i - 4)/9. 


21. y 


« 2e““* - 5€"'*V9 4 a;/3 - %. 


n fdn a; 4 Ca coa a; 4 ap® — 5®. 


)■ 


cie^* 4 4 


2a;® 4 6 x 4 3 


-4e-“* 4 4 2<!*. 


0 . 

0 . 


Section 26, Page 66 

1. j/ « cic® 4 C 2 €®* — (3 sin 2± 4 cos 2x)/20. 

2. y » Cl sin 2x 4 C 2 cos 2 x — (cos 3x)/5. 

8, 1/ cic* 4 4 2 sin X. 

4. y « ci«-®* 4 4 3xc-®^ 4 c*730. 

8. y « c~*(ci sin 2x 4 ca cos 2x) 4 sio 2x — >fo cos 2x). 

6. y » Cie®* 4 — He®*' cos 3x 

7. y - cie-** 4 4 xc^VlO - x®/25 4 4x/25 - % 25 . 

8. y » Cl sin x 4 C 2 cos x 4 % cos 3x — sin 2x. 

9. y «» -e*/4 4 - }>i sin x. 

10 . y - 0 . 

11. y * -34 4 Hoc*"" + e"'^(Ko cos x - H sin x). 

12. y — cos X 4 (x/2) sin x 4 1. 


Section 26, Page 70 


1* V 

2. y 

8 . y 
4. y 
6. y 

6. y 

7. y 

9. y . 

10 . y . 


« Cl 4 4 cse®*®. 

« Cl sin X 4 C 2 cos X 4 e~'®(c 3 cos 2x 4 C 4 sin 2x). 
' Cie"*^ 4 C2xe“* 4 c^x^e'^^, 

' 4 C 2 e* sin (\/3 x) 4 cje* cos (\/3 x). 

' (ci 4 C 2 x)c*' 4 C 3 . 

' (ci 4 C 2 X 4 C 8 X®)c"*' 4 C 4 . 

« Cl cos kz 4 C 2 sin kx 4 Ca cosh kx 4^ sinh kx. 


■ Cl 4 ^C2 


Vl5, ■ . \/i5 

cos ~ X 4 C 3 sin ^ 
2 2 > 


4 4- “ 4 


■( 


Cl sin - 


i 


4 fc 


4 

2 


rs sin x 4 C 4 cos 


V5 


4 2 cos X. 

11. y * cic* 4 C 2 C®^ 4 cgxc®* — x®/4 — x — 

12. y *• cic"'* 4 C 2 e'*' 4 csc®* 4 H + xc*/2. 

18. y « c-”®*/3 4 17cVl2 4 7c'~V2 - - 2x* 4 2x - 6. 


Section 27, Page 72 

1. Dependent. 

4. Dependent. 

7. Independent. 


2. Independent. 
6. Independent. 


8. Independent. 
6. Dependent. 
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Section 28, Pages 76-78 

1 . (o) 2 / ■» (a?® — a 7®)/3 — 2a;/9 — ?^ 7 ; ( 6 ) y « eV 12 ; (c) j/ » x 4 - 2 ; (rf) y «• ain x, 
8 . y “• '*(x®logx)/9, 4. y - cie* 4- caa; + x* 4 1. 

Section 29, Page 77 

8, v’ * c/lx\l - x*)]. 

Section 30, Page 79 

1. 2/ « CiX""® 4* C 2 X“'^ + K log X - 

2, y 0 ^ Cix^ 4* “f — 3ij* 

3, 2/ « 4 + a4/3. 

4. 2 / « rix* 4- C 2 X — x[(log x)^/2 4- log x|. 

6. y »*» cix* 4* C 2 X®. 6, 2/ « cix" + C 2 X“"’"^. 

Section 32, Pages 86-86 

1, 2 / *“ 2 cos \Al3 <, ; ?/ * 2 cos \/l0 t 4* V^IO sin VlO t. 

27r 

3. 1 / « 10 cos \/245 t. 

5 __ _ 

4. 2 / « 10^~^'(cos v/2^0/ •!->-—sin \^220t);R « 400-\/245 dynes. 

v220 

6. y » ie0\/2 e-““‘ cos ^rmt - ; r - + 500\/2 0. 

6. V - 20\/5 «-W''»x«(5 sinh IO.OOOn/o t + \/5 «>sh 10,000-y/S (). 
d^y ^ ^ 

10. 10 —^2 4“ lOfi^i/ » 0; max y «* VS, drop 2 4* V^- 

Section 36, Pages 99-100 

1. 2 / ~ Cl cos t 4- C 2 v«»in t; x ^ c\ sin i — cos L 

2. y “ Cif* 4- C2€"^; x =» cic^ — r2C~^ 

8. 2/ » c'(ri 4- C 2 O; X « e^(ci 4* C 2/2 4- C 2 /). 

4. y =» cic^ 4 - C 2 €“* 4- cg cos 1 4- C 4 sin f; X =» 4“ C 2 e"' — cg cos t — C 4 ain t. 

6. 2 / «“ Ci(l 4* 4 '\/2 )c x ~ cjc'^^ 4 C 2 e"'^^^ 

6. y » J- 4 - 

Cl (^-~^) + C2 + 2~ “ r 

9. Cycloid of radius mE/(elP). 

Section 36, Page 106 

1. yi * cie~^ 4- C2e®^; 2/2 * 2(r2C®* - cic~^). 


CHAPTER 2 

Section 2, Page 118 

8. 1(c), 1(c) for X « 0, ±ir, d=27r, ...; none in Prob, 2. 

4 . 1 (6), 1(c) for X 0, ±:tr, db27r, .., 2(a), 2(d). 

6. 1(a), 1(d); 2(6), 2(c), 2(/): 7. (a) Yes; (6) no. 
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Section 3, Pageo 121-132 

1* div, --i~, div. 

3 log 2 


2. Con» con, div, con, div. 


3. (o) c < 0 ; (5) con for c < 0 . 4. c®*® < n < 

3. ^ < « < ^, 1.08190 < « < 1.08267. One term; eight terms. 

Section 4, Page 124 

1. Biv, con, con, div, 3. Div, con, div, con, con. 3, Con, div, con, con. 

4. (6) No. For example, On n, bn n, Cn 1 -- n, dn 2 — n. 

log (Of) 

3. N > - 

log 10 

Section 3, Page 127 

1. Con, div, con, con for |x| < \/5, con. 2. Con, con for c > 1 only, div. 

S« Con, con, con, div. 

Section 3» Pnge 132 

1. Cond con: div, nbs con, abs con, abs con. 

2. Abs con: lx] < 1 , all x, |x| < 1 , |x| > 1 , —^3 < x < 4, x *« 0, (x — 2| < 1 , all x. 

Cond con: x *» 1 , never, never, x ** — 1 , x » —H, never, x «* —3. 

8 . 0.95. 

Section 7, Pages 137-138 

3. Unif con for (a) — ® < x < «; (b) |x| < c < 0 1; (c) | (2/7r)x — n| > c > 0, 

where n is the odd integer nearest to (2/t)x; (d) 1 < c < I x | < oc. 

4. Unif con for (o) —oo < x < «; ( 5 ) |x| < c < 0.1; (c) 2x/ir »■ odd integer or 
t( 2 /ir)x — n| > c > 0 , where n is the odd integer nearest to (2 /t)x; 

(<f) i^l><^>l<>rkl^c<l. 

3. Yes, no, no, yea. 6 . No. 

Section 8, Pages 142-143 

1. Con for — 1 < X < 1, — \/2 < x < \/2, all x, — 3 < x < 3, — < x < v/3. 

8. (b) tan-' a: - ^ 4. (ti) 2.72, 0.368. 


8. (b) tan-' x ^ 


4w -f“ 1 0 (n -f“ 1)(57 i -f- 6) 


[-t“__ 

a. 3n 4- 6 


Section 9, Pages 146-147 


, 2”(i - D" 


(2n + l)!* (2a+ 1)! ’ 

M c. a j( - .)■ - a - «■ '-gji- 

{rf)2 + *’-3+2(j - 1) +(I “ 1)*: 






>■ (.). r ; ft) S(-1)- 

nl n 4“ 1 



AKiavEna 
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». (a)- 1)»; (&) - i £ (l + ) (* _ 1)-; 

W - g E (l + 2{-l)- - (a - 1)». 

-5r+5'^^‘-’>-(SrTT)i(' -i)’ 


^2n ^4n+2 


-2w+l 


2 T ™, V 

^(2n)l ^(2n~hl)l 




n -f 1* 


Bection 10, Page 149 


1 . (a)S(-l)" 
(c)2(-l)* 
8 , 2(-l)“ 


3.2n+l 

(2n -f l)w! 
«4n+* 


:(f>)2Z 


(4n 4- 3)(2n -f 1)! 

j-p+n 

for p > 0 and all x. 


(2n -f l)(2n + 1)! ' 

~2n4-l 

;(rf)2(-I)»- 


(2n + l)(2n + 1)! 


nl(p + n) 

4 i + viiZifc^ 

'p'*'r nt(p + n)(-ir 


(9 - n) 


5 > 0, p 0, —1, —2, 


Section 11, Pages 162-168 

2 . (a) a + H*’ + Hs** H—; (6) 1 + a + 4—; 

(c) 1 4- H** + H*x* +---A<i)H+Hx- Hsx‘ +■■■; 

(e) 1 - Mx - +■■■;(/) I 

8. (a) 0.00133. 4. 4- .'J^a* 4-- • •• 

6 . 3.004, 0.985, 0.839, 2.036. 6. 0.310, 0.020, -1.025, 0.94. 

T. |«1 < 0.24 radian - 14°. 9. 

Section 12, Page 166 


, 2 “ 


1 - Z-7®”.l+a 4-22:-7.1. 


'nl 


8. sin" 


m2 HI 

•1'3 (2n - 1) 1 

2-4 •••2n 2n + r 


2. y - 1 -f ac* + k ^ 2. 


^2n+l 


Section 18, Page 169 

^2n ^2n-{ 1 


(6) 1 -f oo cos a; + ai sin x; 

l-4x® , 1-4-7X® 
■f--xr- + 


(c)oo(l+|i 


2 . (o) e*; (6) x - 1. 


6! 


9! 

+ • 


X 2x* 2-5x^ . 2°6°8^x 


+ -) + { 

* 3a‘ , 3-6a* . 3-6-9a« 


.).( 


-j-f., 

21 51 8! 


Ill 


8. ciS(~l)«°~-f cjZx", 

nl 


)• 


Section 14, Pages 166-^166 


6e On 


1 


(n -f p)n 


On~t. 


T. (b)J—(ci 

y wx 


(ci cos X + Cl sin x). 
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B9Ktim 17, Ptge 171 


a. 


sia (» -f l)a/2 sin (m/2) 


4u I2w. 


sin (x/2) 


X ^0, dk2r, db4ir, 


Soetioii 19, pRges 182-198 

»•-*■ + £ «<* (2n -1)® + E 2»u: + E (■ 


8 ' ‘ r2(2»-l) 

Section 18, Page 197 

1 , e, e, 0 , c, o, neither, «. 

Seetiofi 80, Pages 191-198 

o ^ y i)(y/2)g 

* ; T 2n + 1 
4. co6«‘;b. 


4n 


1 


2 T ^>-=^(271 - 1)* 


Section 88, Pai:e 804 

8. oi - - ;oj « 0;o8 - —• 
r 3ir 


CHAPTER 8 

Section 1, Page 219 

(o) Entire sey plane; (h) entire xy plane; (c) < 5; 

(d) 4* y* 5^ 0; W X 9^ 0; (/) (x - 1)* + y* < L 


Section 2, Page 823 

L (a) 9^, i :(6) x> + 


ar* a: ■ ' ' ' x* -f y* ' x® + y* ' 

(c) y cos xy 4-1, « cos xy; (d) c* log y, c*/y; 

1 


(e) 2xy 4- 


vr 


8 * (a) 2xy — **, x* + *, y — 2xs; (b) yz + -i x 2 4* -> xy; 

X y 

t —«x 


(c) 


sin'" 


Vy* — X** y-v/yi 1- X*' 

X y z 

Vx* + y* + **’ Vx^ 4- y® + a7x* -f y* + s* ' 
—X —y —z 

(x® 4“ y* -F (x* 4- y® + 2 ®)^' (x® 4* y® 4- *®)^* 


Section 8, Pages 827-888 

1. ir/6 ft®. 

4. 2,250. 

7. 0.112; 0.054. 

10. l.Oir; r. 


l)«~i A Bin nx. 
2 n 


cos (2n — l)irx. 


2. 11.7 ft. 

5. 10,85 
a. 53.78; 0.93. 


8 . 0.139 ft. 

8 . 98.64. 

9. 0.003r; 0.3 per cent. 



ANB^prawS 


J 


785 


StetixMi 4» IPaC# tSO 

1. asaa/o* + m/^ ^ 1- 
C oy 4* V2 a5. 

8, (o)e^(2«rin^-j-^ + ioo8^-Y-i): 

(6) 2r(l — Z tan* 0), — 6r* tan 0 sec* 0 , 

6 . (a) 2a5, 2(aJ 4- tan as sec* x); 

^ 

(6) cos <? — + sin ^ —I r f cos ^-sin 


S* x(a5 — ai^) 4* y(y8 — tmd 


dz 


dy 


5y 


dx/* dz 


Seetion 6, Page 2S6 


1. (o) y' - - 

(b)** 


sec y 4- 3x*y* 


cos z 


X sec y tan y 4- 2x*y * 


3**' 


cos a — 3a* 


2 . 


X* + 


i—j^xVx* + ~ i a V ’ A y'^^'‘ + + 

+ j/*\ dti/ x’ + »*\ du a«>/ 




5. du «■ 2x dx 4- 2y dy *■ 2r dr. 
C. /„ « —— (uy 4* 


w* + »* 


w* 4-f^ 


— t<x). 


Sectioii 8, Page 248 

1. (, + l)/V2 

8. H13-\/5 + !+«(! + -v/3)] * 6.811. 

Section 9, Page 249 

1. 0/3, 0/3, 0/3. a. 8o6<-/3 V3- 8. o/3, b/3, c/3. 

4 . -v/SP/CaVS + 3), (V3 + 1)P/2(2V5 + 3), P/(2y/Z + 3). 

6. i - ft - i -y^eor^r, d = VB1. 


Section 10, Page 264 

8. (o) 105°46', 90"; (6) I64°16', 90.“ 8. d/ Vo* + 6* + e*. 

Section 12, Pages 28(>-'281 

1. J + (j 0 ft + (» - l)ft + (r - l)ftft + ft* 

+ S‘'+T*’‘ + (‘+i)“’+S*’+ - 

where h •• z --J., k ^ y 

2 

a.«|i + (ft + ft) + ^ [ft* + 4ftt + ft*] + • • - j. ft - X - 1 , ft - V - ] 

8. 1 + X + (x* - »*) + i (i* - 3i»*) + ^ (x‘ - 6x*»* +y*) + • • 

Section 18, Pages 288-284 

^ IT sin (s-a/2) , cos (ira/2) — 1 ^ 

1, - 2e«r. 


mm- 
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». 2ac*. 


4. tan a, 

7. air(a* - l)-^. 


Sactloa 14, Page St69 

d 

®- ^ (ini') -gy-f “0. 

Seetfaio IS. Paces 87S-S77 

uhidudvdw. 

S. Ha*{r/2 - H). 


1. Hita*/h. 

8. 32o*/9. 

Sectten 17, Page >81 

1. ir«*/2. 

4. 8a* 


5. ududv. 

6. »(1 - e-^). 


2. 4a*(ir/2 - 1). 

6. 2 •• a cos* (a/2). 


CHAPTER 4 

Sectioa 2, Page 291 

2. A + B + C - 0. 8. A - H(S + D), B - J^(S - D). 

»• (a) iXi ; 2 ( |Af + ]bt)- 

Section 8, Pace 298 

1. (a) 6j, -6j; (b) A + B - 21 + 3J + 4k; (A + B) + C - 3i + 3j + 3k; 

B + C •• 2i + j; associative law; 

(c) 5i + lOj X 15k, -2i - 4j - 6k, 31 + 6j +■ 9k, 3i + 6j + 9k; 

(d) 3i + 6j + 9k, 3i + 3j + 3k, 6i + 9j + 12k; 

(e) -4. 

Section 4, Page 294 

1 . (o) 10, 2, 8; (6) 6, 4, i + 3j + k, 10; (c) 12; 

(d) cos-i 3/V5I; (e) 4/v'5; (/) s - 4; (») -i - j + k. 

2. (b) z —20; y ■” 8; * “ 1. 

Section 6, Pages 206-297 

1. (o) -2i + 3j - 4k, 51 -_4j + 3k, 3i - j - k, 2i + 3j + 3k, Si - j - k; 

(e) 131 + 2j + 2k; (d) y/lf7/2. 

Section 6, Pages 298-299 

2. (a) 0; (b) * - %; (d) 0, 0. 

Secti<ui 7, Pages 801-302 

1. (a) R'(t) - a + 6tj + 3<*k; (b) R'(l) - a + 6j + 3k; 

(e) T - a + 6j + 3k, |v| - 7. 

2. (a) V - R'«) - i + j cos t - k sin <; |R'(e)i - v^; 

(b) a - 2 -v/5. 

Section 8, Pages 808-806 

1. (a) W - 0; (b) W - -2; (c) - 4. 

8. (o) T - -i + 2j - 3k; (b) T - a + 4j + 4k. 
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». («) ▼ - fcA K B; (6) ▼ - JbC X (A - B). 

6. (a) R - XCM + 6J); (6) R(l,2) - H(4i + J). 

Sectfam 9, Page M8 

1. (a) B - 1 + 2J + 3k; (6) cob'* e/ViS; (c) B/VH- 

2. (a) i + 2J + 3k; (6) * - I + <;» - 2<: * - 1 + 3i; (c) V??; 

W + j + ?ik; (/) Hi ~ Hi + Hin. 

S. (o) R - a - 2j + {-4i + 4j - k)«; (b) -41 + 4j - k; 

(d) -ix + 4»-*4-c-0;(«)R--a+k + (-41 + 4j - k)U 
4, (o) IP - 3^ + 8; (5) t - 0; D - 2V2. 

8 . R - (1 + j + 3k)l, (.-H, -H, -H). 

Section 10, Page 811 

1, n - oi -f + ck. 

2. (a) ~i + 6j + 2k; (c) 2i + j + 3k + (~i + 6j + 2k)f - B. 

4. « co8~*^ 9 /\/To5. 

5, -16i + 8j-f-4k. 


Section 11, Pages 815-816 


1. (a) (-1 + j - k)/-v/3; (b) -I + (V + 2) - (* - 2) - 0; 
(c) V3 + log (1 + VU ). 

2. (a) ▼ - 2« + 2j + 21k; A - 21 + 2k; (b) v - iVW+l; 


,, V'4P + 2 „ i-2<j + k 

w * " Vixj: ^ “ - / .k . “o ’ • 


2(2P + D* ' 

8. Let R(0 ■“ (ajl* *1" ail ■!“ ao)i "f* (bjl* bjl + bo)j 4- (cal^ 4" eil 4* ai))k; then iMi, 
equation of the plane through the plane curve is 

(fciC2 btci)(x ~ oo) + (02^1 - oiC 2)(2^ - ^) -f (Ml ~ bia2)(z - Cb) « 0. 

6 . (fl) T - (1 4- 2i)/VE; W - (j - 2i)/v/5; 

(b) V = 1 4- 2j; A - 2j; (c) V, - ^/S; A, - i/y/h; 

(d) s' - Vl 4-41* ; s" - 4t/V 1 4- 41*. 

6. (o) A„ - 2/V5; (b) * - 2/(6-v/5). 


CHAPTER 6 

Section 2, Pages 866-867 

2. gii « 1, ^22 “ p*, gzz “ p® sin* gi 2 * g 2 a • gis ** 0, where p «* aci; 
d *• X2; 0 « xj. 

Section 8, Page 872 

1. At (1,2,3), Vtt « 2i -h 4j 4- 6k; du/dn •« 2\/T4; 

At (0,1,2), Vu - 2j + 4k; du/dn - 2V5. 

2. (o) -(ix + jy -4 kz)(x* -f J/* + 

(6) 2(ix 4 jy 4 k*)(x* 4 y* + **)■"*. 

8. « -3/V6. 

5. n « m - 2j 4 2k). 

8. du/ds ■« —3. 

Section 4, Pages 877-878 


4. du/ds - ~7/V5. 

6. H(2i - 2j ~ k), M(~2i 4 2j 4 k). 

9. dv/d8 - 6/-s/5. 


8 


3. (a) (6) (c) 
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4. Helical patli, 1; rectilinear path, ir*/S — 1. 

«. (a) W m (6) W m ^g, 

Section 5, Page 38S 

1. H- a. u(x,y) - 

4. (o) tt - x»*; (5) xyt. 8. 0; « - log r. 

6 , 0 , 

8. (i* +»* +*^-« - (4 + vl + 4)-**. 

Section 7, Page S88 

1. (a) 3; (b) 2/r; (c) 0. 2. 0, 2/r. 6. Su, 0. 

Section 8, Pagee 800-891 

2. 7raH>. 8. 4ra6c. 4. 4 ^ 0 '. 

Section 9, Pages 896-806 


1. (o) >«: W (c) 5 : (d) - v/§/4; (*) 1 %. 


3. (O) ▼ 

(b)y 


»(** + V*) + j2*v; u - **/3 + x»*; 

1 - / . , y .. 1 !/*-l 

(l+i)» (1 +1)*-'’“ “ 2 (1 + x)» * 


(c) V -■ ij/ COS a; + j sin x; li -• psin x; 

id) V - izy(l - **)“^ - j(l - a^)^i u - -y{i - 

(e) ▼ - i(» + 1) + j(y + 1); u - >i((* + 1)» + (v + 1)®]. 

8. («) 2ir; (6) a*-. 4. 6ir. j, g,. 


Section 10, Page 899 

2. (a) 0; (ft) 0; (c) 0. 


Section 11, Page 402 

2. —T, 8. 0. 4. 0. 

Sectton 12, Page 406 

2. w « j(xi/ - Ha^) 4- k[«(a; +1/) - + y^)l 

8. -u — xy® -f — X. 

4. w « -* Ha^z) 4- + A*). 

6. No. 


Section 18, Page 408 

du ^idu du 

1. v„-n- + -_ + k-. 

a. v«- _^_i. +_+ , 

(w* 4~ dti (w* 4" *^)t» u* 4~ p* du® ^ 
8. div F »■ —3p cos 0/r*, curl P * 0. 

Section 16, Page 414 


1 


a. Irrotational. 4. ♦ » ** - y*, hyperbolas. 

8. Irrotational and solenoidal. 10. — 3 ai*y; <, _ ^ 
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CHAPTffifl 6 

SectiMi 1, PagM 4S>-4S1 

a. (a) [!+(*- 0 <)*)-S -2(* - a<)[l + (* - oi)*]-*; 

(b) 2(1 + 2(1 + a*)-', (P - 21 + 2)-> + (P + a + 2)-*. 

3 . UgyU — U»tly » 0 . 

7. (a) Mv + /*(V + mtt). 

#■ (o) Pi(l/ - o®) + Pj(» + <**); ( 6 ) Pi(» - 2 x) + Pj(k + a); 

(c) Fi(i + tj/) + F!(» — »V)> (<0 P i(y + *) + »P»{y + a). 

10. (d) Fi(v - 6a) + Fi(i/ + ®) - yV60 + ®‘/6- 

11. (a) — + i(2y 4* ic) -f i^a(l/ — x); (6) — -f Fi(y — ox) + Fj(y -f ax); 

(c) -f H 2 I/’ + -- 2x) + Fiiv - X). 


Section 4, Page 440 

2 . w(0,r) *» — sin 2 air; n((0,ir) ■■ ^ cos 2 air. 

ifQ 

3. a ««* d= “ H—n «»■ 0 , dbl, dt2f .... 

15 5 

4. Conot + (aa)-’*-**. 8 . a«-^( 2 a* - 1 ). 

Section 6, Pages 446-446 

1. ±21, ±Zl, .... 


Section 6, Page 449 

2 . ( 6 ) 6 / 2 . 

Section 8, Pages 464-485 

1. 0.45 oscillation per sec. 
Section 9, Pages 468-469 
1. 2.07 X 10« cal/(m2)(day). 
4. (c) II - 


(- 

n-O 


• 1 )" 


(2n + 1)* 


^ (2n + l)»iB 

I 


Section 10, Pages 462-468 

4. (6) u{x,t) * (2n 


I 


X)x\ 


2 

Cn T I f(x) sin 7 (2n — l)x dx. 
L Jq I 

c 


^ _ nxat . nvx 

7 . Zan cos — j- sin -j- ; On ■ 

A Vt < . 

8 . 2/6« sm —Bin - 7 - ; bn " 

I I 

Section 11) Pages 466-467 
3. 0.44883, 0.14922, 0,00004. 
6. u(x,y) * — 2.; 


nxx , 
sin-y-ctx. 


2 

nra , 


gix) sin dx. 


—i— e-O—O-ar/io gin (2n - I) —• 
2n - 1 -^10 
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S. u(x,() “ ~~ ^2n~—j •’***^*"~'**^*'*** (2n ~ 

7. 36.6, 41.9. 

Bte&m IS, Paga 471 

-sfl 

2 v fo r* — 2 rR cos (^ — ^) -f /2* 


i2* - r* 




'0 LiZ® — 2i2r cos (a — + r® /2* — 2i?r cos (fl + 4>) + r® 

md4>. 




Sectkm 18, Page 474 

200 , 

1. [7-50-f — . 


60 f* 

2. I -j 

r Jq a* 


' (2n - l)a®"-i 
a® -r® 


^ sin (2n — 1)^. 


-.d4>. 


0® — 2 ar cos 4* 


4. u(x,y) “ 2 on sin mx sinh imy; a„ - -7-^^— / /(a;) sin ma; dx. 

Sinn rn yo 


Section 14, Page 479 

1\« 


Section 15, Page 482 

1. £ MKr) cos m ^ z{Ajnn cos Umni + Bmn sin ft>wnO; "mn - 7* -f 

2. SAnC"®®*«*/o(^n»*)» where 1 « 2A„Jo(^n»')- 

Section 18, Page 490 

a. (r<A)-« coshPi-d(. 

Jo 2ort 

8. f sinh;^d{. 

Jo 2ort 

4. (4ira>t)->< jT /(«) I f) [e-(*-l»i-.)*/(4<.*() ± j-(«-lnl+.)»/(««*()] j ^ 


Section 19, Page 498 

/ i-oo - ^ 

/ e-C(*-»,>•+(»-»,)»I 

-00 J -~oo 


, +00 f +00 
— 00 


5. n(x, 


' '■+“ f(yhtd 


“ i /_. 4r<»’(l - li) 
fi r-^ VViH dti. 

Jo [4ira®(t - <i)J^ 


g-[x*+(v-vi)*l /4a*(t-ti) 
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1. |fc|<2, -2, >2. 

2* Ee 0 pectiv«ly m, in or outside the unit otrole jp* 4- 1. 

». (o) Ice*' -f (1 - c)e'^ sin x. 

Section 26, Page 512 

2. At the lattice points in the region y > as, « > 0; y > 0. 

a. 


Section 27, Page 514 

1. 19* 


2 . 2 . 


a. 2* 


Section 28, Page 517 

' m 


1. - 






50 *■ n«,i n 100 

8. / - 0.6 + 1.1 E (-1)“ 006 

n»l 1,000 


Section 29, Page 519 

a. Ur, - 0; Urr - Oj l^rr + U„ - 0. 


CHAPTER 7 

Section 1, Page 588 

1. (a) 2, ir/3; (5) 2^5, (c) 2, ir; (d) 1, 3ir/2; 

(e) V2A 7ir/4; (/) 1, ir/2; (y) H, r/3; (/t) 4, ,r. 

2. (o) ->8; (6) -1 + t; (c) (2 -- V5 )/2 - /(2 -f \/3 )/2. 

8. (a) 1 ; (6) 1; (c) 1. 

4. COB <r/6) + ♦ sin (r/6), cos (t/6 4- 2»'/3) 4-1 Bin (r/6 -f 2r/3), 
cos (ir/6 4- 4ir/3) + t sin (r/6 4* 4r/3). 

7. (a) 1, 4- tv^), Hi -1 - tV3); (b) 1, t, -1. 

16. (a) Circle x* 4* « 1 ; (b) circular region x* 4- y* < 1; 

(c) re^on exterior to the circle x* 4- “ 1 including the boundary. 

18. (a) Circle radius 2, center at (1,0); (6) circle radius l/\/const, center at (0,0). 

Section 2, Page 585 

1. (a) (x® - y® - X 4* 1) 4- ii2xy ~ y); (6) x/(x* 4- y^ ~ ty(x* 4* y*); 

(d) (X* + y* - i)/(a:* 4- (y 4- D*] - i2x/lx* -f (y 4- l)*l; 

0x4“ t2y; (y) (x* 4“ y*) ^ 

2. (a) Open region x < 3, —« < y < <»; 

(6) The region y>l, —«o<x<oo; 

(c) The region exterior to the circle of radius 1 with center at the origin and in¬ 
cluding circular boundary; 

(d) Circular ring centered at the origin with interior radius 1, exterior radius %, 
including the boundary of the inner circle; 

(e) Open circular region with center at (1,0) of radius 1; 
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if) Gosed oiroulAr region of ntdius 1 with center Kito 

iff) Open region exterior to the circle of radiue 2 with center at (0, — 1). 

Section 8, Pages fiSMMO 

». («) H(e~* + *) COB 2 + - «) Bin 2; (c) e* *'* («) 

^ («) Hi* + «”*) on 1 + Hii* — «~*) CO* 1; 

(<J) «**^(008 2*j/ + 18in2a:v); 

(e) (cofl ly/i^ + y*)] »sin Iy/(** + y^)l. 

8. (o) log 4 + irt; (W log 5 -f tir/2; («) (/) e(coB 1 -f » sin 1); 

iff) 

6b (a) (r + 2Tk)i; (h) ir/2 -f 2rk - f log (2 db is/3); (d) 2rk, Jfe - 0, d=l, db2, ± • * • 

Section 4y Page 646 

8. (h) * «• —1; (c) * *• 0; (d) « t/ 2 + fc*-, A? “ 0, rbl, ±2,..(e) * «■ 1, r -* —1 

(/)f (y)# (^) nt all points; (t) a ■■ 0, (fc) c ■■ dbi. 

Section 6, Page 648 

1. H(2 -f Hi). 

8. 1 along rectUineaTi 1 + ♦/3 along parabolic. 

6b 0. 6. 0. 7. (-2). . 

Section 7, Page 666 

8. 2iit. 8. 2, upper half; —2, lower half. 

6. 0. 10. (o) 0; (6) 0; (c) -irt; (d) iri. 

Section 6» Page 669 

1b 2irt(8 - 130. «• W 0; (&) 2irt; (d) 0; («) 2irt. 

6. --21^. 


Section 9, Page 661 

1. ti S ~ Sav®. 

8. (o) * -f ly, (b) cosh y cos a; — t sinh y sin «; (d) «^(co8 y + » sin y). 


Section 10, Pages 664-666 

a. f 

iS5 n 

00 -n 00 

fl-i. 

nmml n 


Section 11, Page 669 


1. (o)i + 2 + 3*+ 4*®+ •••; 
z 

B , -io. _L J_4._L_ 

2!*‘■" 3U* 4I*» 
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S. Fimotions ore expressed in Laurent’s series. 

4. (a) - -i- _ 1 - (* - 1) - - D*-; 

e “* X 


2 3 1+2* 1+2* 

!****• . 1 


2 2 * ~ 2 * 2 * 


+ -+-i + ri + 


SMtion 12, Paeei 573-67S 


( 6 ) 1 --.+ 


2 ! 2 * 


11 

3! 


, residue (—M); 
, residue (0); 


(/) e* +* +^ +^ +jj^ H-, residue(J^O: 


* 212® 

(t) Residue — at 2 


(tf) 


2 ” 2 ! 2 * 3!2 


16 322 

4! 5t 
1, residue at 2 -• 


-* •••, residue(-^O: 

1 . 


6. No. 


Section 13, Page 674 

1. (a) -Tt; (c) 2irt73I; (d) ~8W/3; (/) 0^ 

2. 2Tt. 4. (o) 0; (b) 2trt. 

Section 18, Page 686 

1 . (a) COB X cosh y, sin x sinh y; ( 6 ) e* cos y, c* sin y; 

(d) log (** + y*)^, tan-‘ (y/x); (e) */(a^ + y*), -v/(i* + v*). 

2 . (c) v «■ e* sin y ~ x; (d) sinh a; sin y. 


CHAPTER 8 


Section 1, Page 612 

L Hs. 

8 . 

6. 81, 71, 7!/2. 

7. (a) K: (6) 0; (c) Ksi (<0 bi¬ 
section 2, Page 616 

1 . J4, h, H. H, h, H. H. 

6. Hi. 

Section S, Pages 621-622 

1. 33/16,660. 

8. Hi, Hi, Hi- 


2. Ho. H, Ho, Ha, 1,323/46,189. 
4815! 13147! „„47!6I 

52! ’ 8!52I ’ 52! ' 

12 2m + 2n — 4 

o. —--- 

n m wn 


2, Questions 1, 2, 8 can be answered. 

7. H. 


2 . 

4. *Ho. 



m 
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». H, Hi- 

7. Ho.X«. 

61 6 ! 18,781 

*’ 52* 62 ... 48’ 13‘ 


8 . n > Oog 2 )/(log 6 — log 5). 
. /3\‘ 1211... 8 


62.51 


48 


270,840 
62*61 •6049* 


10- 1 - n « 3. 
Section 4, Pnges 026-627 


1 . 8 ?». 2 . « epq. 

Q>) W 1. 4. H 2 + Hi "f Ho + * * • + H 

6 . 2 . 


Socticai 6 » Page 681 


1. ^ 2 , Hz. ^Hz. ^Hz. Hz. Hz^ 4. 2 ^ 3 . 


Section 6, Page 687 

1. 0.75, 0,60, F(x) - X* for 0 < X < 1, m - 0.794. 

2. Hr. 0.206. 8. Ha. 

4. 6. K. 

Section 7, Pages 640-641 

1. K«, 2 and 3, 2. (a) 125/3,888. (h) 2%48. 

8 . (0.66)^® -f 10(0.66)®(0.35) + 45(0.65)«(0.35)* *f 120(0.65)^0.36)*. 

4. 6(H)* + 4(H)*(H) + 45(H)^H)* -f 40(H)*(H)* + 15(H)HH)*. 

6 . 741/2,728. 7. 0.57,0.57, - - -• 

n n 


Section 8 , Page 644 

1. 0.499. 

Section 9, Page 660 

1. 0,039. 

8 . 0.979. 

6 , 0.083, 0.166, no. 

Section 10, Page 664 

1. 1,640. 

3. ^%o. 

Section 11, Page 668 

1. (a) 0.368, 0.402; (b) 0.368, 0.373. 
8 - 1,006. 

6 . 0.577. 


2 . i^( 1.5) ~ i>(~ 1 . 0 ) * 0.806. 
4. 0.0222. 


2 . 46,413/78,125. 
4. 0.91854. 


2. 0.768. 

4. Expected number «» 5. 


Section 12, Page 868 

2 . 0.82. 


Section 18, Page 667 

t H2. 

8 - H V5 - 0.577. 


6.83. 


2. H, H, «H.ye8,p « ^ 1 . 
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Section IS, PagM $TQMi71 

1. 7.5, 0.26. a. 0.146, 64. 

CHAPTER 9 

Sectfon 1, Page 679 

1. (a) -0.8 <xi< -0.7, - 2, - 4; 

(6) -0.8 <xi< -0.7, 1.2 < »2 < 1.3; 

(c) -0.8 < asi < -0.7, -0.6 < xa < -0.6. 1.0 < ai < 1.1; 

(d) -0,6 <xi< 0.6; (e) 4.4 < xi < 4.6. 

2. ;i « 1.23. 

Section 2, Page 684 

1. Prob. 1: (a) -0.76; (6) -0.73, 1.22; (c) -0.77, -0.66, 1.08; (d) -0.67; (e) 4.49: 
Prob. 2:1.226. 

2. -0.942, -0.200, 1.046. 

Section 8, Page 686 

3. 2.310 radians. 

4. (o) 0.739; (6) 0.667; (c) -0,725, 1.221; (d) 2.924; (e) 1.046, -0.942, -0.200. 
Secti(m 4, Pages 688-689 

(a) X « y - « ■■ -Ka; 

(h) ai « 1; as ■■ — 1; as « —2; a4 *■ 3; 

(0 ai « -0.107; xa « 0.988; ai - 0.317. 

Section 7, Pages 694-696 

1. 2 / « 0.25a* - 0.50a + 0.26. 

Section 8, Page 696 

2. 9.466, 12.549. 

Section 9, Page 700 

1. 2.784, 2.700. 2. If Sb - 60, ^ « 40.82, 42.62, 42.50. 

8. 2.581, 2,627. 4. 106.09. 

Section 10, Page 702 

1. (o) 1/ - H* + K; W V - 2.6a® *; (c) 0.3(10® **). 

Section 11, Page 711 

1. y - 4.98 - 3.13a 4* 1.26a*. 

8. IC - 1.778; h - 1.9349; 8 - 60.02(0.861)^ 

Section 12, Page 716 

1. y 0.75 0.10 cos a — 0,05 cos 3a — 0.29 sin a. 

2. y «« 0.85 - 0.26 cos 2a - 0.05 cos 4a + 0.06 cos 6a + 0.26 sin 2a - 0.03 sin 4*. 
Section 18, Pages 720-721 

L 25.252, 25.068. 

8. 128.6. 

6 . 39.30, 38.98. 


2. 132,137. 

4. 666.25, 666.00. 
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SvetioB 14. I>iiice 7SS 

1. y{±0.2) - 1; y(±0.4) - 1.08; y{±0.6) •• 1.061; »(±0.8) - 1.124; y(±1.0) 
• 1.214. The coirespoDding exact values are 1.010, 1.041, 1.094,1.174, 1.284. 

».»-»! + y'(x{)(x - »i) + — - y'(ao)l(* - ®i)*- 

2h 

». vi - 1.0100;»» « 1.0403; yt - 1.0927. 

Sectioa IB, PacM 726-787 

1. y, - 2.0442; vi - 2.3274; ys - 2.6509; y« - 8.0190; yw - 3.4363. 

8. y(0.3) - 1.3498; y(0.4) - 1.4917; y(0.5) - 1.6485; y(0.6) - 1.8218. 

4. 0.2740. 

Section 16| Page 7S0 

Is 1/ « a; + X* + -h -R 4 ;« * X + + Ri- 

2. 0.1, 0.2206, 0.3627, 0.6281. 

8. 1.01, 1.031, 1.063, 

is + + + 

« » X 4" 4“ 4“ H 2 ** 4— *• 

8. 1.0062, 1.0215, 1.0602. 


APPENDIX A 

Sectioa 1, Page 7i7 

1. (18, 0, -1, 0). 2. (18, 0, -1, 0). 8. 1. 

Section 2, Pages 762-788 

1. (a) (2, -1, 1); (6) (1, (c) (3, -1, 2); (d) (1, rl, -2, 3). 

2. (o) (--ifcA 6A;/7, A:); (6) (0, 0); (c) (0, 0, 0); (d) (k/4, 7k/S, k); (e) (k, 2k, 0); 

(/) (0, 0, 0). 

8. (a) (1, —1); (6) inconsistent; (c) inconsistent; (d) (1, 3k —2, k). 


APPENDIX B 

Section 2, Pages 78S-789 

2* y * cos 2x 4- H sin X — sin 2x. 8. j/ ** 4- “ x. 

8. y « 2 4- - xe~® - 3e~*. 6. y - c^. 

Secti<Hi 8, Pages 761-762 

1* V «* 2. y * 1, y « X, y - for x > 0. 

Secticm 4, Pages 764-766 

1. ^ cosh a? — ~ \/3 sinh x. 

3 V3 2 V3 

SL (a) cos x; (6) cosh x 4- sinh x « K 4" Ks”*®*; 

(c) e“’^*(co8h >^\/5 X — sinh H V6 x). 

Section 6, Page 767 
t y - 

JQ Jo 
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«. (a) H (b) - ««- + Me-**; 

(c) «*-«+«-- Ke"*’- 

Section 6, Paces 768-769 

+ ^cuL — t-(Mll)i _ Qjj;, cos u< + cR sin at with c •• 




8. I,-* fV(T)«~<*/«<“-^)(iT - R-» /V'(t)[ 1 - e-CS/iX*-^)] (Jr 

./» /o 







INDEX 


The letter p. after a page number refers 

Abers theorem, on differential equations, 
54p. 

on power series, 142 
Absolute convergence, of integrals, 766 
of series, 127, 170 

Absolute value of complex numbers, 168, 
528 

Acceleration, normal, 313 
tangential, 313 
vector, 302p., 313 
in cylindrical coordinates, 367p. 
Adams’ method, 723, 728 
Addition, of complex numbers, 166, 629 
of matrices, 327 
parallelogram law of, 288, 629 
of series, 117 
of vectors, 288, 317 
Adiabatic expansion, 60p. 

Algebraic equations, 677 
solution of, by graphical methods, 678 
by iterative methods, 679 
systems of linear, 350, 687, 689, 749 
Alternating series, 128 
Amplitude, of simple harmonic motion, 44 
of waves, 428 
Analytic functions, 540 
branch points of, 670 
Cauchy’s formula for, 555 
Cauchy's theorem for, 647 
differentiation of, 641, 567 
essential singular points of, 570, 674 
geometric representation of, 676 
integrals of, 545, 647, 551 
Laurent's expansion for, 666 
mapping by, 575-694 
maximum modulus theorem for, 658, 
561 

poles of, 570 
residue theorem for, 573 


to a problem, the letter n. to a footnote. 

Analytic functions, residues of, 670 
singular points of, 643, 569 
Taylor's series for, 661 
Angle, phase, 44, 528 
solid, 399p. 

Angular momentum, 305 
Angular velocity, 302, 399p. 

Antenna, radiation from, 486 
Arc length, 301 
in curvilinear coordinates, 362 
of an ellipse, 147 
Argand’s diagram, 528 
Argument of a complex number, 628 
Arithmetic means, 667 
Asymptotic equality, 12, 123 
Atmospheric pressure, 60 
(See also Pressure) 

Attraction, of a cone, 277p. 

Coulomb’s law of, 408n. 
of a cylinder, 277p. 

Newton’s law of, 46 
of a sphere, 47, 277p., 410 
Augmented matrix, 750 
Average, arithmetic, 667 
Average-value theorem, 498 


Base or coordinate vectors, 319 
in curvilinear coordinates, 863 
in cylindrical coordinates, 367p. 
orthonormal, 321 
in spherical coordinates, 367p. 
transformation of, 337, 867p. 

Basis, 319 

Beams, bending of, 16 
buckling of, 95p. 
cantilever, 16 

on elastic foundations, 86p., 94p. 
vibration of, 435p. 
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INDEX 


Bezi4mg moment, 16, 435p. 

Bernoulli's differential equation, 27 
BemoulU*‘Eiiler law, 16 
Bessers differential equation, 159 
Bessers functions, 162, 198, 480 
as 3 anptotic formulas for, 199p 
expansion in series of, 481 
generating function for, 166p. 
orthogonality of, 198 
seros of, 198 
Bessel’s inequality, 202 
Beta function, 149, 765 
Biharmonic equation, 430p. 

Bilinear forms, 349 
Bilinear transformation, 677p. 

Binomial distribution, 639 
generating function for, 640p. 
Binomial frequency function, 639 
Binomial law of probability, 639 
Laplace’s approximation to, 647 
normal approximation to, 647 
Binomial theorem, 155 
Binormal, 312 
Boltzmann constant, 633 
Boundary-value problems, 91, 442, 730 
Bounds for Fourier coefficients, 211 
Branch points, 570 
Buffon’s needle problem, 637 


Cable, flow of electricity in, 514 
hanging under gravity, 40, 454 
oscillations of, 445, 454 
supporting roadway, 42p. 

Calculus of variations, 264 
isoperimetric problems, 269 
problems with constraints, 269 
Cantilever beam, 16 
Cartesian reference frames, 321 
Catenary, 41 

Cauchy’s convergence criterion, 115 
Cauchy's differential equation, 78 
Cauchy’s inequality, 322 
Cauchy’s integral formula, 555 
Cauchy’s integral test, 120 
Cauchy’s integral theorem, 547 
Cauchy’s principal value of an integral, 
602 

CSauchy-Riemann equations, 413 
Oauchy-Schwarz inequality, 322 
<>iiter, of gravity, 275, 281 


Center, of mass, 44, 308 
motion of, 44, 304 
Chain under gravity, 40 
Chain rule, 228 

Change of variables, in functions, 237 
in integrals, 270 
Channel, flow from, 694 
Chaplygin’s method, 37 
Characteristic equation, 54, 67, 521, 733 
of a matrix, 344 

for systems of linear differential equa¬ 
tions, 100, 106p., 733 
Characteristic frequencies, 477, 479p., 481, 
482p. 

Characteristic functions, 732 
Characteristic values, 344, 507p., 732 
Characteristic vectors, 344 
Characteristics, 440, 508, 617 
discontinuities on, 519 
Chemical combinations, 14 
Chemical reactions, 15 
Circle of convergence, 170, 562 
Circulation, 397, 591 
Clairaut’s equation, 27p. 

Cofactors, 741 

Column, axially loaded, 86p., 90 
Euler’s critical load for, 92 
Combinations, 611 
Combinatory analysis, 611 
Comparison tests, for integrals, 755n. 

for series, 122, 125, 134 
Complementary function, 59 
Complex function, 634 
continuity of, 640 
differentiation of, 541 
integration of, 543 
Complex numbers, 166, 527 
absolute value of, 168, 628 
addition of, 166, 629 
argument of, principal, 628 
conjugate, 529 
modulus of, 528 
operations on, 528 
phase angle of, 528 
polar form of, 628 
roots of, 681 
Complex potential, 587 
Complex roots of unity, 532 
Complex variable, elementary function* 
of, 534 

Complex-variable theory, 523-604 
Components of a vector, 291, 317, 321 
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Composite functions, 230 
Compouxid probability, 617, 635 
Condenser discharge, 81, 84 
Conductivity, thermal, 415, 455 
Conformal mapping, 583, 598 
examples of, 675-595 
invariance of harmonic functions under, 
585 

Eiemann’s theorem on, 586 
Schwarz-Christoffel formula for, 699 
Conjugate complex numbers, 529 
Conjugate harmonic functions, 560, 588 
Conjugate matrix, 343 
Conservation, of energy, 43 
of matter, 417 
of momentum, 44 
Conservative force fields, 408 
Constraints in calculus of variations, 269 
Continuity, 218 
of complex functions, 540 
equation of, 412, 417 
piecewise, 765 
of scalar functions, 368 
of vector functions, 299, 368 
Contour integrals (see Line integrals) 
Convergence, circle of, 170, 562 
interval of, 139 
radius of, 139, 170, 562 
of series (see Series) 
uniform, 132 
Convolution, 488 
Convolution theorem, 488, 762 
Coordinate lines, 272, 369 
Coordinate surfaces, 359 
Coordinate vectors, 319 
Coordinates, affine, 366 
curvilinear, 357 
divergence in, 406 
gpradient in, 407 
volume in, 364 

cylindrical (see Cylindrical coordinates) 
orthogonal, 363 
parabolic, 408p. 
spherical, 360, 367p. 

Correlation coefficient, 666 
Coulomb's law, 408n., 467 
Couple, 305 
Covariance, 664 
Cramer's rule, 326, 749 
Cross product, 294 
Grout's reduction, 687n. 

Curl, 396 


Curl, in cartesian coordinates, 398 
in curvilinear coordinates, 406 
relation to rotation, d99p. 

Current flow, 416 
in cables, 514 

in electrical circuits, 81, 87, 1(X1, 76Ip,, 
768p. 

Curvature, 150, 311 
Curve, elastic, 16, 86p. 

Frenet’s formulas for, 311 
integral, 7 
length of, 301 
mininuzing, 265 

of minimum descent, 269p., 767 
motion on, 301, 313 
normal to, 311 

piecewise or sectionally smooth, 372 
pursuit, 33 
on a surface, 309 
trihedral associated with, 312 
Curve fitting, by finite differences, 694 
by graphical means, 701 
by least squares, 702 
by trigonometric functions, 711 
Curvilinear coordinates (see Coordinates) 
Cycloid, motion on, 46p., 269p., 767 
Cylindrical coordinates, 369 
acceleration components in, 367p. 
base vectors in, 367p. 
velocity components in, 367p, 
volume element in, 365 


Z>, 57 
V, 370 
V*, 387, 407 
A, 510, 692 
5, 761 

D'Alembert's solution of wave equation* 
439, 485 

Damped oscillations, 449 
Damping, viscous, 82, 449 
Definite integrals (see Integrals) 
Deformation of contours, 649 
Del, V (see Gradient) 
de Moivre's formula, 530 
Dependence, linear, 52, 70, 317 
Derivative, directional, 243, 253, 369 
(See also Gradient) 
normal, 244, 253, 369 
partial, 219 

Determinants, 325, 741-753 



destermiiiants, cofaetoi^ ol, 741 
differentiation of, 743 
expansion of, 326, 741 
ininors of, 741 
multiplication of, 825, 745 
solution of equations by, 326, 748 
Wronskian, 52, 54p., 71 
Difference equations, 510, 734 
Dirichlet’s problem for, 511 
elliptic, 511, 518 
hyperbolic, 513, 518 
parabolic, 5X2, 518 
Difference operators, 510, 692 
Differences, backward, 692 
finite, method of, 734 
forward, 691 
Differential, 223, 310 
approximations by, 226, 311 
of arc length, 362 
exact, 226, 380 
total, 226, 234 
of volume, 364 

Differential equations, elliptic, 505, 511, 
518 

Euler’s, 78, 267 
exact, 20 
hyperbolic, 507 
Lagrange’s, 26 

ordinary (see Ordinary differential equa¬ 
tions) 

parabolic, 506, 512 

partial (see Partial differential equa¬ 
tions) 

systems of, 96, 733 
Differential form, quadratic, 362 
Differential operators, 67, 430p. 
Differentiability, 226 

Differentiation, of analytic functions, 642 
chain rule for, 228 
of composite functions, 280 
of definite integrals, 261 
of determinants, 743 
of Fourier series, 210 
of implicit functions, 230, 235 
of infinite series, 135 
numerical, 698 
partial, 219 
of power series, 140 
of vector functions, 299 
Diffusion, 14, 416, 46dn. 

Diffuflivity, 463n, 

Dimensional analysis, 483 


Dipole, 408p., 496 
Dirac’s delta function, 761 
Dirac's distribution, 769 
Direction cosines, 370 
Directional derivative, 243, 253, 369 
(See also Gradient) 

Dirichlet’s conditions, 180 
Dirichlet's kernel, 205 
Dirichlet’s problem, 467, 502, 611, 696 
for arbitrary regions, 696 
for a circle, 469 
for a half plane, 484 
for a half space, 503 
Dirichlet’s theorem, 180 
Discontinuity, simple, 178 
Discrete distributions, 627, 628 
Discrete variables, 628 
Dispersion, 427 
Distribution function, 632 
Distributions, binomial, 639, 640p. 
bivariate, 666 
continuous, 631 
discrete, 627, 628 
Gaussian, 633 
Maxwell-Boltzmann's, 633 
normal, 633, 661 
Poisson’s, 633 
Divergence, 384 
in cartesian coordinates, 386 
in curvilinear coordinates, 406 
Divergence theorem, 388, 493 
Dot product (see Scalar product) 
Double layer, 497 
Dummy or summation index, 324 
Dynamics, laws of, 302 


c, 655n. 

173, 636 
e*, 172, 536, 581 

Eigenfunction (characteristic function) 
732 

Eigenvalue (characteristic value), 344 
507p., 732 

Eigenvector (characteristic vector), 344 
Elastic curve, 16, 8dp. 

curvature of, 16 
Elasticity, 599 

Electric circuits, 81, 87, 100, 756 
Electromechanical analogies, 81 
Electron, acceleration of, 46, 46p. 
mass-to-charge ratio, lOOp. 
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EkotroBtatie field, 692, 694 
Kleotroetifrtios, 467, 496 
KUipfle, length of, 147 
Elliptic differential equation, 605 
difference equation for, 5X1, 518 
Elliptic integrals, 49, 80p., 148 
Exnissivity, 463 
Empirical formulas, 701 
Energy, conservation of, 43, 48 
kinetic, 43, 306p. 
potential, 43, 264 
Envelope, 36 

Equation of continuity, 412, 417 
Error function (probability integral), 
table, 776 

Errors, estimate of, 658, 660 
Gauss’ law of, 662 
mean-absolute, 662 
mean-square, 662 
probable, 662 

in solving differential equations, 38, 
104 

theory of, 658 

Essential singular points, 570, 574 
Estimate, of errors, 658, 660 
maximum likelihood, 640, 660 
reliability of, 670 
unbiased, 640 
of variance, 669 
Euclidean space, 321, 374n. 

Euler’s critical load, 92 
Euler’s differential equation, 78 
invariational calculus, 267 
Euler’s formula, for exponentials, 173, 536 
for Fourier coefficients, 175, 196 
Euler's hydrodynamical equations, 419 
Euler's polygonal curves, 721, 727 
Euler's theorem on homogeneous func¬ 
tions, 234 

Euler-Pourier formula, 175, 196 
Even functions, 183 
Fourier expansion for, 184 
Events in probability, 610, 618, 619, 638 
Exact differential, 226, 380 
Exact differential equations, 20 
Expansion, adiabatic, 50p. 
of determinants, 326, 741 
Fourier, 175, 196 
Heaviside, 766 
Laurent, 564, 565 
Maolaurin, 144 
in power series, 144 


Expansion, in series of orthogonal funo 
tions, 201 
Taylor, 144 
Expectation, 623, 634 
of product, 663 
of sum, 624 

Expected frequency, 627 
Expected value, 623, 639 
Exponential function, 172, 536, 581 
Extrapolation formulas, 696 
Extreme values, 250, 264 
Extremum, 250 


Factor, integrating, 22 
Factorial, n!, approximation for, 644 
Factorial function, 162, 755 
(See also Gamma function) 

Falling bodies, 47 
Fermat's principle, 264 
Field, 367 
conservative, 408 
electrostatic, 467, 496, 592, 594 
gravitational, 409, 467 
irrotational, 402 
solenoidal, 402 
Field theory, 35^20 
Finite differences, method of, 734 
Flexural rigidity, 93 
Fluid flow, 411, 416, 587-595 
under dam, 593 
ideal, 419, 687, 692 
incompressible, 412, 418 
irrotational, 412, 588 
out of channel, 594 
solenoidal, 412 
stagnation points in, 690 
steady, 412, 587 
vortex in, 591 
Flux, 384 
Force field, 408 

electrostatic, 467, 496, 592, 694 
gravitational, 409 
Forced vibrations, 86, 461 
Fourier coefficients, 175, 196 
bounds for, 211 

Parseval's equality for, 202, 204p. 
Fourier expansion, 176, 196 
for odd functions, 185 
Fourier heat equation, 414 
Fourier integrid equation, 192 
Fourier integrals, 190, 194 
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Fourier series, 176,196 
complex form of, 192 
convergence of, 200, 204 
differentiation of, 210 
doable, 476 

for even and odd functions, 184 
extension of interval for, 187 
integration of, 207 
uniqueness theorem for, 186 
Fourier transform, 194, 482-490 
Free vibrations, 79, 432, 444, 446, 475 
Frenet-Serret formulas, 312 
Fresnel integrals, 147, 153p. 

Frequency, characteristic, 477, 479p., 
482p. 

relative, 615, 638, 642 
resonant, 89, 477 
Frequency equation, 478, 481 
Frequency function, 627 
binomial, 639 
Fuchs’ theorem, 167 

Fundamental theorem of integral calculus, 
9, 261, 550 


Gamma function, 149p., 162 
Gas, ideal, 221 
viscosity of, 451 
Gauss’ distribution, 633 
Gauss’ divergence theorem, 388 
Gauss’ law of errors, 662 
Gauss’ reduction method, 360n., 687 
GausaJordan reduction, 687n. 
Gaus£hSeidel method, 689n. 
Geometric series, 115 
Gradient, V, 244, 367, 390 
in cartesian coordinates, 370 
in curvilinear coordinates, 407 
Graeffe’s root-squaring method, 679n. 
Gram-Schmidt method, 351 
Graphical solution of equations, 678 
Gravitational attraction, 277p., 409 
motion under, 47, 49p. 
Gravitational constant, 46 
Gravitational field, 407 
Gravitational potential, 409 
Gravity* center of, 275, 281 
Gravity dam, 593 
Greatest lower bound, 773 
Green’s function, 601 
for half space, 502 
Oieen’s identities, 391p., 493 


Green’s theorem, in plane, 391, 402p. 

symmetric forms of, 391p., 493 
Growth factor, 418 


Harmonic analysis, 711 
Harmonic function, 468, 560, 685 
average value theorem for, 498 
conjugate, 660, 588 
differentiability of, 669 
maximum values of, 499p., 606, 668, 561 
Harmonics, 177 
Heat capacity, 455 
Heat equation, 414, 456 
solution of, by integrals, 482 
by separation of variables, 459 
by series, 465-471 
uniqueness of, 466, 506 
Heat flow, 414, 455-467, 483, 504, 612 
connection with random walks, 653 
in a rod, 456-466, 489, 769 
source function for, 491, 653 
in a sphere, 471 
Heat source, 489, 604, 663 
Heaviside’s expansion theorem, 766 
Helix, 314, 316p. 

Helmholtz formula, 499 
Hermitian form, 348 
Herrnitian matrix, 349 
Holomorphic function, 643 
Homogeneous differential equations (see 
Ordinary differential equations) 
Homogeneous functions, 18, 234 
Euler’s theorem on, 234 
Hooke’s law, 80 
Homer’s method, 679n. 

Hydrodynamics, 416, 419 
{See also Fluid flow) 

Hydrostatic pressure, 593 
Hyperbolic differential equation, 607 
difference equation for, 513 
Hyperbolic functions, 537, 691 
Hypergeometric equation, 166 

Ideal fluid, 419 

Images, method of, 448, 462p. 

Implicit functions, differentiation of, 280, 
235 

Improper integrals (see Integrals) 

ImpuliM function, 759 
Indefinite integral, 551 
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ludepeiidence, linear, 52, 70, 317 
of path, 378, 393 

Independent events in probability, 610, 
618, 638 

Indidal equation, 161 
Inertia, moment of, 274 
Infinite aeries (see Series) 

Inner product (see Scalar product) 
Integral calculus, fundamental theorem of, 
9, 261, 550 
Integral curve, 7 
Integral equations, 767 
Integrals, of analytic functions, 545, 547, 
561 

of Cauchy’s type, 557 
change of variables in, 270 
of complex functions, 545 
contour (see Line integrals) 
convergence of, absolute, 765 
differentiation of, 261, 262 
elliptic, 49, 86p., 148 
evaluation of, by fundamental theorem, 
9, 261 

by numerical methods, 717 
by residue theorem, 599 
by series, 147 
improper, 118, 553n., 602 
principal value of, 602 
indefinite,^ 551 
Lebesgue, 774 
line (see Line integrals) 
mean-value theorem for, 380 
multiple, 270 

particular (see Particular integrals) 
probability, 776 
Riemann, 771 
Stieltjes, 630 
surface, 277, 373 
transformation of, 382-402 
volume, 374 
Integrating factor, 22 
Integration, numerical, 716 
Interpolation, 679 
Interpolation formulas, 696, 699 
Interval, closed, 132n., 772n. 
of convergence, 139 
open, 218, 772n. 

Inverse elementary functions, 539p., 540p. 
Inversions, of matrices, 333, 350 
of order, 742 
Irrotationcd field, 402 
Irrotational flow, 412 


IsocHnes, 36 

Isolated singular points, 669 

Iterative methods, 679, 684, 689, 721-730 

/nW (see Bessel’s functions) 

Jacobian, 238, 242pii, 271 
Jump of a function, 620 

Kn(x) (see Bessel’s functions) 

Kinetic energy, 43, 306p. 

Kronecker delta, 321 

Lagrange’s differential equation, 26 
Lagrange’s interpolation formula, 699 
Lagrange’s multipliers, 250, 254 
Laplace transform, 754-769 
bilateral, 762n. 
convolution theorem for, 762 
of derivatives, 756 
of Dirac’s ‘‘function,” 769 
Heaviside’s theorem on, 766 
solution by, of differential equations, 
756-762 

of integral equations, 767 
tables of, 770 
unilateral, 762n. 

Laplace’s difference equation, 511, 736 
Laplace’s equation, 409, 413, 416, 419, 464, 
467, 735 

Laplace’s law in probability, 647 
Laplace-de Moivre limit theorem, 648 
Laplacian operator, 387 
in curvilinear coordinates, 407 
Laurent’s expansion, 564 
uniqueness of, 567 
Laurent’s theorem, 565 
Law, of errors, 662 
of large numbers, 650 
of mechanics, 302 
Newton’s (see Newton’s law) 
parallelogram, 288, 529 
of probability, binomial, 639, 647 
normal, 647, 663 
of reflection, 297p. 
of refraction, 297p. 
of small numbers, 654 
Least squares, 663p., 702 
connection with orthogonal functions, 
200 

curve fitting by, 702 
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Lebeflgue integrikl, 774 
Lebeague theorem, 775 
Legendre polynomiala, 160 
expansion in series of, 196, 473 
generating function for, 159p. 
orthogonality of, 198 
jElodrigues^ formula for, 159p. 
Legendre's equation, 168 
Leibnis' formula, for differentiation of 
integrals, 262 

use in evaluating integrals, 263 
Leibniz’ test, 128 
Length of arc, 301, 362 
of an ellipse, 147 
Level surface, 369 
Line, equation of, 306 
line integrals, 373 
of analytic functions, 547-654 
in complex plane, 545 
independent of path, 378, 393 
transformation of, 382-402 
of vector functions, 374 
Linear algebraic equations, 350, 687, 689, 
749 

Linear dependence, 52, 70 
of vectors, 317 

Linear differential equations (see Ordinary 
differential equations) 

Linear fractional transformation (bilinear 
transformation), 577p. 

Linear operators, 336, 754 
linear transformation, 332 
linear vector spaces, 316 
Linearity, property of, 61 
Lipschitz condition, 38 
Log z, 537, 681 

Logarithmic function, 537, 581 
principal value of, 537 
Lower bound, 773 


Mtest, 134 

Maclaurin’s formula, 260 
Maclaurin’s series, 144 
Mapping, by analytic functions, 575-594 
conformal (see Conformal mapping) 
Mass, center of, 303 
motion of, 44, 304 

Matrices, algebraic operations on, 827-331 
inversion of, 333, 350 
product of, 328 
transformation of, 340*350 


Matrix, 327, 749 
augmented, 750 
characteristic equation of, 344 
characteristic values of, 344 
conjugate, 343 
determinant of, 750 
diagonal, 329, 339, 343 
Hermitian, 349 
identity, 329 
inverse of, 333 
orthogonal, 340 
rank of, 380, 750 
scalar, 329 
singular, 331 
square, 327 
symmetric, 347 
transpose of, 334 
unit, 329 

unitary, 343p., 350 
zero, 329 

Maxima and minima, 246 
absolute, 247 
constrained, 249, 269 
relative, 247 

(See also Calculus of variations) 
Maximum modulus theorem, 558 
Maximum principles, 606, 607 
Maxwell-Boltzmann distribution, 633 
Mean errors, 200, 669, 660 
reliability of estimate of, 670 
Mean-value theorem, of differential cal¬ 
culus, 224 
for integrals, 380 
Measurable set, 773 
Measure, 772 

Measure numbers, 291, 319 
Measure theory, 614, 772 
Mechanics, laws of, 302 
Median value, 637p. 

Membrane, under gas pressure, 482p. 
vibration, of circular, 480 
of rectangular, 474 
Metric coefficients, 360 
Minima (see Maxima and minima) 
Minimax, 249 
Minimizing curve, 265 
Minimum descent, curve of, 269p., 767 
Minimum potential-energy principle, 264 
Minors of a determinant, 741 
Modes, 477, 481 

Modulus of a complex number, 528 
Moment, bending, 486p. 
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Moment, of difKile, 497 
of fm'oe, 303 
of inertm, 274 
of momootum, 305 
Momeatum, angular, 305 
linear, 42, 44 
moment of, 305 
Momentum vector, 305 
Monte Carlo methode, 652 
Multiple integrals, 270 
Multiply connected region, 383 
Mutually exclusive events, 610, 619 


Nabla or del, V, 370 
Neighborhood of a point, 540 
Neumann’s function, 508 
for half plane, 504 
Neumann’s problem, 503 
Newtonian potential, 277p., 409 
Newton’s interpolation formulas, 696 
Newton’s law, of attraction, 46, 409 
of cooling, 461 
of gravitation, 46, 408 
of motion, 42, 43 

Newton's method of solving equations, 684 
Nodal lines, 477 
Nodes, 429, 445 
Normal, to a curve, 311 
principal, 311 
to a surface, 309, 369 
Normal acceleration, 313 
Normal derivatives, 244, 369 
Normal distribution, 633, 651 
bivariate, 666 
Normal equations, 703 
Normal law of probability, 647 
interpretation of, 658 
Normal line, 309 
{See also Normal) 

Numerical analysis, 673-736 
Numerical differentiation, 698 
Numerical integration, 715 
Numerical solution of differential equa¬ 
tions, 37, 721-736 


Odd functions, 188 
Fourier expansion for, 185 
Operator, curl, 398, 407 
A 57, 430p. 

V, 370, 386, 407, 692 


Operator, V®, 387 
4, 510, 692 
difference, 510, 692 
div, 386, 406 
Fourier transform, 482 
Laplace, 387 
Laplace transform, 754 
linear, 336, 754 

Order, of differential equations, 6, 29, 76, 
425 

Interchange in partial differentiation, 
221 

inversions of, 742 
reduction of, 29, 76 
Ordinary differential equations, 1-106 
Abel’s theorem for, 54p. 

Bernoulli’s, 27 
Bessel's, 159 

boundary-value problems in, 91, 730 
Cauchy’s, 78 

Chaplygin’s method for, 87 
characteristic equation for, 54, 67, 101 
Clairaut’s, 27p. 

with constant coefficients, 54, 66, 100 
of electric circuits, 81, 100, 761p. 
Euler’s, 267 
Euler-Cauchy's, 78 
exact, 20 

existence and uniqueness theorems for, 
5, 7, 38, 157 
first-order, 17-50 
linear, 23, 51, 59 
Fuchs’ theorem on, 157 
Gauss’ hypergeometric, 166p, 
homogeneous, first-order, 18 
linear, 51, 54, 59, 96 
systems of, 100, 733 
hypergeometric, 165 
indicial equations for, 161 
initial-value problem for, 9, 90, 730 
integral curves for, 7 
integrals of, 7 
integrating factors for, 22 
integration between limits, 13 
isoclines for, 36 
Lagrange’s, 26 
Legendre’s, 158 

linear, complementary function for, 59ii. 
with constant coefficients, 54, 66 
systems of, 95, 733 

with variable coefficients, 51, 59, 
70, 153 
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Ordinaiy differentia] equations, order of, 
6, 425 

reduction of, 29, 76 
with separable variables, 18 
solutions of, 6 
general, 10, 52, 59, 67, 102 
by Laplace transform, 766-768 
linearly independent, 7, 59, 72, 76p. 
by numerical method, 87, 721-786 
particular, 7, 59, 72, 76p. 
by power series, 153 
singular, 8, 34 
stability of, 103, 105 
uniqueness of, 7, 38, 157 
systems of, 95, 110, 727 
characteristic equation for, 101, 783 
Origin, 288 

Orthogonal coordinates, 363 
Orthogonal curves, 31 
Orthogonal matrices, 340 
Orthogonal sets of functions, 195 
completeness and closure of, 203 
expansion in series of, 201 
relation of least squares to, 202 
Orthogonal trajectories, 30 
Orthogonal transformations, 340 
Orthogonal vectors, 319 
Orthogonality weighted, 197 
Orthogonalization, of matrices, 340, 350 
of vectors, 320 

Orthonormal functions, 195, 197 
Orthononnal vectors, 320 
Oscillations, of cable, 445, 454 
damped, 449 
period of, 44, 81, 84 
of spring, 80, 82, 86, 88, 89 
Osculating plane, 311 


Parabolic coordinates, 408p. 

Parabolic differential equation, 506 
difference equation for, 612 
Parabolic mirror, 33 
Parallelogram law of addition, 288, 529 
Parseval's equality, 202, 204p. 

Partial differential equations, 5, 425-521 
boundary conditions for, 443 
canonical forms of, 517 
characteristic values for, 507p. 
characteristics for, 441, 521 
of elliptic type, 504 
of heat flow 414, 4^ 


Partial differential equations, of hydrody¬ 
namics, 416, 429 
of hjrperbolic type, 504, 616 
of parabolic type, 504, 616 
of potential theory, 409, 41 In. 
solutions of, by Fourier transform, 
482-490 

fundamental, 521 
by integrals, 482-504 
by Laplace transform, 769p. 
numerical, 734 
by series, 448-482 
uniqueness of, 505-510 
of vibrating membranes, 475, 480 
of vibrating rods, 435p., 485p. 
of vibrating string, 431, 484 
of wave motion, 428 
Partial differentiation, 219 
interchange of order in, 221 
Partial sum of series, 111 
Particular integrals, 7 
by method of undetermined coeflScients, 
59 

by variation of parameters, 72 
Pendulum, 48, 49, 85p. 

Period, of oscillations, 44, 81, 84 
of pendulum, 49, 86p. 
of vibration, 443 
of waves, 428 
Permutations, 611 

Phase of simple harmonic motion, 44 
Picard’s method, 609 
Piecewise continuity, 765 
Piecewise smoothness, 206, 372 
Plancherers theorem, 483n. 

Plane, equation of, 306 
osculating, 311 
tangent, 277n., 309 
Point at infinity, 577 
Point set, 773 
measure of, 774 
Point vortex, 691 

Points, in nniimensional space, 316 
in sample space, 613 
Poisson’s distribution, 633 
Poisson’s equation, 268, 41 In., 495 
uniqueness of solution of, 499p. 

Poisson's formula, 495 
for a circle, 470, 699p. 
for a half plane, 490 
for a half space, 503 
Poisson's law of probatnlity, 654 
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Poles of analytic functions, 670, 674 
residues at, 671 
simple, 670 

Polynomial representation of data, 694, 
702 

Potential, complex, 587 
electrostatic, 467, 496 
gravitational, 409, 468 
Newtonian, 277p,, 409 
Potential energy, 43 
principle of minimum, 264 
Potential theory, 468 

{See also Dirichiefs problem; Neu¬ 
mann's problem) 

Power series, 138 
Abel’s theorem on, 142 
convergence of, absolute and uniform, 
139 

radius of, 142, 170, 562 
differentiation and integration of, 140 
evaluation of integrals by, 147 
expansions m, 144, 561 
multiplication of, 142 
solution of differential equations by, 153 
substitution in, 152 

uniqueness of representation by, 141, 
146 

Precision constant, 662, 670 
Pressure, atmospheric, 50 
on gravity dam, 693 
in star's interior, 48 
Primitive, 551 
Principal argument, 528 
Principal normal, 311 
Principal value, of improper integrals, 602 
of log z, 537 

Probability, 610, 614, 652 
binomial law of, 639, 647 
compound and total, theorems on, 617, 
635 

events in, 610, 618, 619, 638 
Laplace's law in, 647 
law of large numbers, 651 
law of small numbers (Poisson's), 654 
marginal, 625 
normal law of, 647, 653 
Probability density, 632, 634 
Gauss', 633 
joint, 634 

Maxwell-Boltzmann's, 633 
Poisson's, 633 

Probability integral, table of, 776 


Probable error, 662 
Probable value, 635 
Product, of determinants, 326, 745 
of matrices, 328 
of vectors, 293, 295, 298 
Projectiles, 50 

Pulley, slipping of belt on, 11 
Pursuit curves, 33 
Pythagorean formula, 322 


Quadratic forms, 347 
differential, 362 
positive definite, 349 
Quadrature, 715 


Radiation from antenna, 486 
Radiation condition, 501 
Radius of convergence, 139, 170, 662 
Random molecular motions, 653n. 
Random process, 622 
Random variables, 623, 662 
Random walks, 653 
Rank of a matrix, 330, 750 
Ratio test, 125 
Rational numl>er, 772 
Reflection, law of, 297p, 
transformation of, 341 
Refraction, law of, 297p. 

Regions, bounded, 535 
closed, 218, 535 
connected, 383 
multiply, 383, 535 
simply, 383, 535 
finite, 535 
open, 218 
regular, 383 

Relative frequency, 615, 638, G42 
Remainder in Taylor's series, 144 
Residuals, 703 
Residue theorem, 573 
evaluation of real integrals by, 599 
Residues, 571 
Resoiiauce, 86, 89, 453 
Resonant frequency, 89, 477 
Riemann function, 508 
Riemann integral, 771 
Riemarm’s mapping theorem, 586 
Rocket, motion of, 45, 46p. 
thrust, 45 

Rodrigues' formula, 159p. 
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Roots of unity, 532 

Eotation, of slioft, oritieal speed of, 94 
transfonuation of, 341 
velocity of, 399p. 


Sample space, 613, 634 
Soali^, 287 
Scalar fields, 367 
Scalar product, 293n., 319 
Scalar triple product, 297 
Schwars* inequality, 322 
Sohwars-Chriatoffel mapping formula, 599 
Seidel's method, 689 
Separation of variables, 18, 459, 732 
Series, 111 
addition of, 117 
alternating, 128 
basic properties of, 116 
binomial, 155 
Cauchy’s criterion for, 115 
comparison tests for, 122, 125, 134 
of complex terms, 169 
convergence of, 112 
absolute, 127, 170 
conditional, 129 
fundamental principle for, 114 
in the mean, 200 
pointwise, 204 

tests for, comparison, 113, 122 
integral, 118 
Leibniz’, 128 
ratio, 125 
uniform, 132, 136 
Weierstrass test for, 134 
differentiation of, 135 
evaluation of integrals by, 147 
Fourier (see Fourier series) 
geometric, 115 
harmonic, 113 
integration of, 135 
Iiaurent's, 565 
Maclaurin’s, 144 
multiplication of, 131 
of orthogonal functions, 195 
power (see Power series) 
rearrangement of, 129 
remainder in, 113 

solution of differential equations by, 
153, 465 
sum of, 112 
partial, 111 


Series, Taylor’s, 144, 561 
telescoping, 116p. 
trigonometric (see Fourier series) 

Shaft, critical speed of rotation, 94 
Shearing load, 435p. 

Significance level, 649 
Similar transformations, 339 
Similitude, principle of, 433 
Simple closed curve, 383n, 

Simple harmonic motion, 44 
Simple pendulum, 48 
Simply connected region, 383 
Simpson’s rule, 717 
Singular integral, 493 
Singular points, 643, 669 
essential, 570, 574 
isolated, 569 

Singular solutions of differential equa 
tions, 34 
Sink, 384, 591 
Solenoidsl field, 402 
Solid angle, 399p. 

Solution of differential equations (se 
Ordinary differential equations; Par 
tial (liffei'ential equations) 

Sommerfeld's radiation condition, 601 
Sound, equation of propagation of, 420 
velocity of, 435p. 

Source, 384, 591 
of heat, 489, 504, 653 
Space, complex, 322 
dimensionality of, 317 
Euclidean, 321, 374n. 
linear vector, 316 
sample, 613, 634 

Space curves (Frenet-Serret formulas), 315 
Specific heat, 416 
Spectral theory, 199 
Spheric^il coordinates, 360, 367p. 

Spring, oscillation of, 80, 82, 86, 88, 99 
Stability, of columns, 91 
of rotating shafts, 92 
of solutions of differential equations, 
103, 106 

Stagnation points, 590 
Standard deviation, 664, 669 
of the mean, 667 
unbiased estimate of, 670 
Statistical hypothesis, 614 
relation to significance level, 649 
Steady*«tate solutions, 88, 765 
Steady-state temperature, 457, 471 
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Stieltjefi integral, 630 
Stirling’s formula, 644 
Stokes' theorem, 400, 402lp. 

Stream function, 413 
Streamlines, 413, 414^,, 587 
String, vibration of, 425, 431-454, 484 
Sturm-Liouville theory, 199 
Summation convention, 324 
Superposition principle, 766 
Surface, level, 369 
normal to, 309, 369, 373n. 
one-sided, 383 
piecewise smooth, 277, 383 
regular, 384 

tangent plane to, 277n., 309 
two-sided, 277, 373n., 383 
Surface integrals, 277, 373 
Systems of equations, differential, 95, 738 
linear algebraic, 350, 687, 689, 749 


Tangent line, 301, 311 
Tangent plane, 277n., 309, 311 
Tautochrone, 767 
Taylor’s formula, 143 
approximations by. 149 
for functions of several variables, 257 
Taylor’s series, 144, 661 
Telegrapher’s equation, 514 
Thermal conductivity, 415, 465 
Torque, 303 
(See also Moment) 

Torsion, 312 

Total differential, 226, 234 
Total probability, 618, 635 
Transcendental equations, 677, 679. 684 
Transforms, Fourier, 194 
Laplace, 764 
Trapezoidal rule, 717 
Trigonometric functions, 172, 636 


Undetermined coefficients, method of, 69 
Uniform convergence, 132 
Uniqueness, of representation, in Fourier 
series, 186 

in Laurent's series, 667 
in power series, 141, 146 
of solutions, of ordinary differential 
equations, 7, 38, 157 
of partial differential equations, 505- 
510 


Unit impulse, 750 
Unit vectors (see Base vectors) 
Unitary matrices, 34dp., 350 
Unitary transformations, 343p. 


Value, absolute, of complex numbers, 168, 
528 

characteristic, 344, 507^., 732 
expected, 623, 639 
extreme, 250, 264 

maximum, of harmonic function, 499p, 
506, 558, 561 
median, 637p. 

principal, of improper integrals, 602 
of logarithmic function, 537 
probable, 635 
Variance, 664, 669 
Variation, 265 
of parameters, 72 
Vector, velocity. 301, 367p. 

Vector acceleration, 302p., 313, 367p. 
Vector analysis, 285-361 
(See also Vector field theory) 

Vector field theory, 355-420 
Vector functions, continuity of. 299, 368 
line integrals of, 374 
Vector product, 296 
Vector spaces, 316, 323 
Vectoi’s, algebraic operations on, 288-298, 
316-324 

base or coordinate (see Base or coordi¬ 
nate vectors) 
bound, 288 
characteristic, 344 
components of 291, 317, 321 
continued products of, 297 
coordinate, 319 
differentiation of, 299 
free, 288 

linear dependence of, 317 
magnitude of, 288 
momentum, 305 
orthogonal, 319 
orthonormal, 320 
parallelogram law for, 288 
product of, 293, 295, 298 
sliding, 288 
unit, 319 
zero, 289, 317 
Velocity, angular, 302 
of escape, 47 



IKPS^ 


812 

Vdocity, of rotation, 2Q9p. 
of sound, 4B5p, 
in wave motion, 434 
Velocity potential, 418, 419, 584, 587 
Velocity vector, 301, 367p, 

Vibration, of beams, 435p. 
of membranes, 474, 480 
period of, 443 
of string, 425, 431-454, 484 
Viscosity of gases, 450 
Viscous damping, 82, 86, 87, 449 
Volume integral, 270, 374 
Vortex, 691 


Wave equation, 428, 432, 484, 499, 508 
with damping, 449, 452 
solution of,, D’Alembert’s, 439, 485 
Fourier integral, 484 


Wave equation, solution of, Fourier series, 
448 

by separation of variables, 463p. 
uniqueness of, 439, 442, 508 
Wave front, 486, 519 
Waves, 425, 436 
amplitude of, 428 
period of, 428 
plane, 488 
shock, 520 
standing, 428 
Weierstrass M test, 134 
Work, 302, 306p., 409, 418 
Wronskian determinant, 52, 54p., 71 


Zero vector, 289, 317 
Zonal harmonics (see Legendre poly¬ 
nomials) 
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